Host cell protein knock-out cells for production of therapeutic proteins

ABSTRACT

The present invention relates to methods and means for making Vitamin K-dependent protein compositions which are devoid or substantially devoid of protein contaminants. In particular, methods and means useful for the reduction or elimination of protein contaminants also being Vitamin K-dependent proteins are described.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a 35 U.S.C. § 371 national stage application of International Patent Application PCT/EP2006/064220 (published as WO 2007/006808 A1), filed Jul. 13, 2006, which claimed priority of European Patent Application 05106401.2, filed Jul. 13, 2005; this application further claims priority under 35 U.S.C. § 119 of U.S. Provisional Application 60/706,369, filed Aug. 8, 2005.

FIELD OF THE INVENTION

The present invention relates to methods for producing compositions comprising Vitamin K-dependent protein having a very low or negligible content of protein contaminants and to compositions derived from such methods. Such methods may either be used alone or in combination with other methods for the purpose of reducing the relative content of protein contaminants. The present invention is particularly relevant in the preparation of compositions of coagulation factors selected from Thrombin polypeptides (FII/FIIa), Factor X polypeptides (FX/FXa), Factor IX polypeptides FIX/FIXa), Factor VII polypeptides (FVII/FVIIa), and the anticoagulant Protein C, in particular Factor VII polypeptides.

BACKGROUND OF THE INVENTION

In the production of recombinant proteins from cultures of microorganisms or cell lines, the final production step is the recovery and optionally the concentration of the product of interest. Culture media in which the cells have been grown and which contain secreted proteins, and, in particular, cell lysates containing intracellular proteins of interest also contain, to a greater or lesser extent, other proteins produced by the cells, apart from other contaminants, such as media components, nucleic acids and the like. In order to obtain a purified protein product, it is therefore necessary to separate the protein of interest from other proteins and polypeptides and other impurities in the crude material containing the protein of interest. It is however, often difficult to remove protein contaminants comprising domains of the same nature as the polypeptide of interest.

Vitamin K-dependent proteins are distinguished from other proteins by sharing a common structural feature in their amino terminal part of the molecule. The N-terminal of these proteins, also referred to as the Gla-domain, is rich in the unusual amino acid γ-carboxy glutamic acid which is synthesized from glutamate in a Vitamin K dependent reaction catalysed by the enzyme γ-glutamyl carboxylase. Because of the presence of about 9 to 12 Gla residues, the Gla-domain is characterised by being capable of binding divalent cations such as Ca²⁺. Upon binding of metal ions, these proteins undergo conformational changes which can be measured by several techniques such as circular dichroism and fluorescence emission.

The discovery of metal induced conformational changes of Gla-containing proteins (Nelsestuen et. al., J. Biol. Chem. 1976; 251, 6886-6893) together with identification of conformation specific polyclonal antibodies (Furie et al., J. Biol. Chem. 1978; 253, 8980-8987) opened the way for the introduction of conformation specific immunoaffinity chromatography. These antibodies could recognise and bind the Gla-domain in the presence of Ca²⁺ ions but released the protein upon removal of Ca²⁺ ions using a Ca²⁺ chelator such as EDTA or citrate.

In 1980's conformation specific pseudoaffinity chromatography was developed making use of the unique property of Gla containing proteins to undergo metal induced changes in conformation. Pseudoaffinity chromatography differs from the conventional affinity chromatography in that there is no immobilized affinity ligand involved and it is performed on a conventional chromatographic matrix (Yan S. B., J. Mol. Recog. 1996; 9, 211-218). The Gla protein can be adsorbed to an anion exchange material by eliminating divalent metal ions. Subsequently, elution is performed by adding Ca²⁺ to the elution buffer.

In 1986, Bjørn and Thim reported purification of rFVII on an anion exchange material taking advantage of Ca²⁺-binding property of Gla-domain of FVII (Bjørn S, and Thim L., Research Disclosure, 1986, 26960-26962.). Adsorption was achieved in a buffer without Ca²⁺ and elution of FVII was possible using a Ca²⁺ containing buffer with low ionic strength and under mild conditions. Yan et al. have used the same principle for the purification of recombinant human Protein C (Yan S. B. et al., Bio/technology. 1990; 8, 655-661).

Brown et al. (Brown et al., J. Biol. Chem. 2000; 275, 19795-19802.) have reported monoclonal antibodies specific for Gla residues. These antibodies could recognize all of the Gla proteins tested: Factor VII, Factor IX, Factor II, Protein C, Protein S, GAS-6, bone matrix Gla protein, conantokin G. Several conformational specific antibodies raised against one Gla protein show cross reactivity with other Gla proteins (Furie B. and Furie B., J. Biol. Chem. 1979; 254, 9766-9771; Church et al., J. Biol. Chem. 1988; 263, 6259-6267).

While the presence of the Gla-domain provides an advantage for separation of Gla containing proteins from other proteins, the inventors of present invention observed that similar properties and behaviour of the Gla containing proteins makes it difficult to separate them from each other.

Proteins with a Gla-domain comprise the following proteins: GAS-6, Protein S, Factor II (Prothrombin), Factor X, Factor IX, Protein C, Factor VII, Protein Z, Transmembrane gamma-carboxyglutamic acid protein 1, Transmembrane gamma-carboxyglutamic acid protein 2, Transmembrane gamma carboxyglutamic acid protein 3, Transmembrane gamma-carboxyglutamic acid protein 4, Bone Gla protein, Matrix Gla protein, and Osteocalcin.

The need for efficiently separating a Vitamin K-dependent protein of interest, such as a Gla-domain containing polypeptide of interest, from protein contaminants is a particularly relevant issue when dealing with the purification of such polypeptides produced in cell cultures, because the host cell may produce significant amounts of protein contaminants that may cause undesirable immunogenic reactions upon use of the polypeptide.

SUMMARY OF THE INVENTION

The present invention relates in a broad aspect to the generation of compositions comprising a Vitamin K-dependent protein of interest which is devoid or substantially devoid of at least one protein contaminant expressed by the host cell.

Thus in a first aspect the present invention relates to a host cell expressing a Vitamin K-dependent protein of interest, the host cell being modified to express a substantially lower amount of at least one protein contaminant expressed endogenous by the host cell in the absence of the modification. In one embodiment the host cell is transfected with a polynucleotide construct to encode the Vitamin K-dependent protein of interest.

The term “modified” as used herein refers to a cell that has been engineered by any man-made molecular or cell biology techniques or process useful in the industry.

In a second aspect the present invention relates to a method for producing a host cell according to the invention, the method comprising the following steps in any order:

-   -   a) optionally transfecting the host cell with a polynucleotide         construct encoding a Vitamin K-dependent protein of interest;         and     -   b) modifying the host cell to express a substantially lower         amount of at least one protein contaminant expressed endogenous         by the host cell in the absence of the modification.

In a further aspect the present invention relates to a method for producing a composition comprising a Vitamin K-dependent protein of interest with a substantially lower amount of at least one protein contaminant expressed endogenous by the host cell in the absence of modification, the method comprising the steps of growing a host cell expressing a Vitamin K-dependent protein of interest, the host cell being modified to express a substantially lower amount of at least one protein contaminant expressed endogenous by the host cell in the absence of the modification, in a growth medium and harvesting the growth medium comprising the Vitamin K-dependent protein of interest.

In a further aspect the present invention relates to a method for producing a composition comprising a Vitamin K-dependent protein of interest with a substantially lower amount of at least one protein contaminant expressed endogenous by the host cell in the absence of modification, the method comprising the steps of:

-   -   a) producing a host cell according to the invention; and     -   b) growing the host cell in a growth medium and harvesting the         growth medium comprising the Vitamin K-dependent protein of         interest.

In a further aspect the present invention relates to a composition produced by a method for producing a composition comprising a Vitamin K-dependent protein of interest with a substantially lower amount of at least one protein contaminant expressed endogenous by the host cell in the absence of modification, the method comprising the steps of growing a host cell expressing a Vitamin K-dependent protein of interest, the host cell being modified to express a substantially lower amount of at least one protein contaminant expressed endogenous by the host cell in the absence of the modification, in a growth medium and harvesting the growth medium comprising the Vitamin K-dependent protein of interest.

In a further aspect the invention relates to modified cells expressing a Vitamin K-dependent protein of interest useful for generating compositions comprising a Vitamin K-dependent protein of interest, devoid or substantially devoid of protein contaminants expressed by the host cell.

In a further aspect the invention relates to methods for reducing or eliminating the content of at least one protein contaminant in a composition comprising a Vitamin K-dependent protein of interest wherein at least one protein contaminant expressed by the host cell is inhibited.

In a further aspect the invention relates to new nucleic acid sequences encoding protein S in CHO cell.

In a further aspect the invention relates to a new amino acid sequence of protein S in CHO cell.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates the RansiRNA vector.

The vector is composed of two polymerase III promoters transcribing the siRNA template in each direction. The two RNA transcripts are complementary and anneal to form the final siRNA molecule. The vector contains a hygromycin resistance gene which makes it possible to select for stable cell clones.

FIG. 2 illustrates steps in the Gene targeting method.

In the CHO Protein S gene targeting construct the coding part of exon 1 has been exchanged by a hygromycin or a blasticidin resistance gene for positive selection. Furthermore, the TK gene is inserted next to exon 2 for negative selection. Two cre/lox sites are flanking the resistance gene. Following homologous recombination the cell population can be screened using primers specific to promoter region outside the construct and to the resistance gene in the construct. Once the alleles have been knocked-out for wildtype Protein S, the cells may be transfected by an expression plasmids containing Cre recombinase. The Cre recombinase will recombinate at the cre/lox sites and resistance genes are deleted from the cell genome.

FIG. 3 illustrates down regulation of the Protein S gene in CHO-K1 cells using the synthetic made gene ZNF-PS.

FIG. 3 a: The synthetic gene ZNF-PS downregulates Protein S transcription in CHO-K1 cells, determined by luciferase reporter assay. The figure shows luciferase readout from a reporter containing the Protein S promoter. The pRL-CMV (Promega, Madison) vector was used as control for transfection efficiency. ZNF-PS down regulates Protein S promoter activity by 50% in a transient transfection.

FIG. 3 b: The synthetic gene ZNF-PS downregulates Protein S transcription in CHO-K1 cells, determined by real-time PCR on Protein S mRNA. The figure illustrates a realtime PCR quantitation of the Protein S mRNA in CHO-K1 transiently transfected with ZNF-PS. The pEGFP (Clontech, Mountain View) vector was used as control for transfection efficiency. In this experiment ZNF-PS also down regulates Protein S 50%.

FIG. 4: The Protein S gene is localized onto two different chromosomes in the same metaphase of CHO-K1 cells. The figure illustrates Protein S gene localization in the CHO-K1 genome. FISH was per-formed on CHO-K1 chromosomes using Protein intron 1 as probe.

FIG. 5: Two zinc finger proteins fused to nucleases bind inside exon1 of the CHO Protein S gene. The figure illustrates DNA binding specificity of two zinc finger proteins fused to Fok I nuclease.

The left zinc finger protein is expected to bind to 5′-GTCCTGAGC-3′ (upper strand) and the right zinc finger will bind to 5′-GCTGGTATG-3′ (upper strand) both sequence element is harbored by Protein S exon 1. The two zinc finger are either fused to Fok I og Sts I nucleases, the nucleases will homodimerize and perform the cleavage of the DNA strands.

FIG. 6: Gene targeting by homologous recombination enhanced by zinc finger nuclease cleavage. The figure illustrates the step in homologous recombination enhanced by zinc finger nucleases.

The zinc finger nucleases will bind their specific binding sites within Protein S exon 1 and cleave the DNA strands. The gene targeting vector transfected along with the nucleases contains a large fragment identical to the Protein S gene, on each side of the EGFP gene. Recombination occurs between the Protein S gene and targeting vector. Recombinant cells can be sorted due to EGFP expression.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to a host cell for the production of recombinant proteins, wherein this host cell is modified to express a substantially lower amount of at least one protein contaminant expressed endogenous by the host cell in the absence of the modification.

It will be understood that any method or technique for reducing expression of the contaminating protein may be used. The examples of such methods including siRNA targeting, targeted gene knock-out, transfection with a transcriptional factor, and site-specific cleavage of the DNA strands encoding protein contaminants are not to be construed limiting in any way. In principle, any molecular biology, cell biology, or selection method may be used to reduce the expression level of a particular protein contaminant. The present invention is particular useful in the situation, where the Vitamin K-dependent protein of interest is very closely related with one or more protein contaminants, such as when the protein contaminant is a second vitamin K-dependent protein. Due to the close relationship between a vitamin K-dependent protein of interest and a protein contaminant, which is a second vitamin K-dependent protein, such protein contaminant may be very difficult remove by purification methods.

The present invention further relates to compositions comprising Vitamin K-dependent proteins of interest devoid or substantially devoid of at least one protein contaminant expressed by a host cell.

In one embodiment of the invention, the Vitamin K-dependent protein of interest is selected from the group consisting of GAS-6, Protein S, Factor II (Prothrombin), Factor X, Factor IX, Protein C, Factor VII, Protein Z, Transmembrane gamma-carboxyglutamic acid protein 1, Transmembrane gamma-carboxyglutamic acid protein 2, Transmembrane gamma carboxyglutamic acid protein 3, Transmembrane gamma-carboxyglutamic acid protein 4, Bone Gla protein, Matrix Gla protein, and Osteocalcin. The Vitamin K-dependent proteins may be in either an activated or a non-activated form, such as Factor II and Factor IIa, and Factor X and Factor Xa.

In one embodiment of the invention the Vitamin K-dependent protein of interest is a coagulation factor, such as e.g. FVII or FVIIa polypeptides. In one embodiment the Vitamin K-dependent protein of interest is wild type human FVIIa.

In one embodiment of the invention, the protein contaminants is a second different Vitamin K-dependent protein. Thus, the protein of interest and the protein contaminant may both be a Vitamin K-dependent protein.

In one embodiment of the invention the protein contaminants is Protein S. In one embodiment, the protein contaminants is hamster Protein S.

In one embodiment of the invention the host cell is selected from the group consisting of CHO cells, 293 (HEK293) cells, BKH cells, HKB11 cells, SP2/0 cells, and NS0 cells.

The present invention furthermore relates to a host cell expressing a Vitamin K-dependent protein of interest, which host cell comprises a siRNA construct targeting at least one protein contaminant expressed by the host cell.

The term “siRNA” as used herein refers to small interfering RNA, sometimes known as short interfering RNA or silencing RNA known in the art of molecular biology.

In one embodiment the host cell has been modified by transfection with at least one siRNA polynucleotide construct targeting a mRNA encoding a protein contaminant expressed endogenous by the host cell.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest comprises a siRNA construct targeting at least one protein contaminant expressed by the host cell, wherein the protein contaminant is a second vitamin K-dependent protein.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest comprises a siRNA construct targeting at least one protein contaminant expressed by the host cell, wherein the protein contaminant is Protein S.

The present invention also relates to a cell expressing a Vitamin K-dependent protein of interest comprising a disrupted gene for at least one protein contaminant expressed by the host cell.

In one embodiment the host cell has been modified by disruption by gene knock-out of at least one endogenous gene encoding a protein contaminant expressed endogenous by the host cell. In one embodiment the endogenous gene encoding Protein S has been disrupted by gene knock-out of exon 1.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest comprises a disrupted gene for at least one protein contaminant expressed by the host cell, wherein the protein contaminant is a second vitamin K-dependent protein.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest comprises a disrupted gene for at least one protein contaminant expressed by the host cell, wherein the protein contaminant is Protein S.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest comprises a disrupted gene for Protein S, wherein the Protein S gene is disrupted by omission of exon 1.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest has been modified by transfection with at least one transcription factor binding to a DNA element of the gene encoding the protein contaminant expressed endogenous by the host cell.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest has been modified by transfection with at least one nuclease fusion protein for site-specific cleavage of the DNA strands encoding the protein contaminant expressed endogenous by the host cell.

The present invention furthermore relates to a cell expressing a Vitamin K-dependent protein of interest comprising a transcription factor binding to at least one protein contaminant expressed by the host cell.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest comprising a transcription factor binding to the DNA sequence encoding at least one protein contaminant, the protein contaminant is a second vitamin K-dependent protein.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest comprises a transcription factor binding to the DNA sequence encoding at least one protein contaminant, the protein contaminant is Protein S.

In one embodiment of the invention the transcription factor is a Zinc finger protein. In one embodiment of the invention the Zinc finger protein binds a DNA element comprising the sequence of SEQ ID NO 35.

In one embodiment of the invention the Zinc finger protein binds the GGAGAGGAGGGGGGG DNA element.

In one embodiment the host cell expressing a Vitamin K-dependent protein of interest is modified by random mutagenesis for disruption of at least one endogenous gene encoding a protein contaminant expressed endogenous by the host cell.

The present invention also relates to a method for reducing the content of at least one protein contaminant in a composition comprising a Vitamin K-dependent protein of interest, wherein at least one protein contaminant expressed by the host cell is inhibited.

In one embodiment the method for reducing the content of at least one protein contaminant in a composition comprising a Vitamin K-dependent protein of interest, wherein at least one protein contaminant expressed by the host cell is inhibited is a method comprising the use of siRNA.

In one embodiment the method for reducing the content of at least one protein contaminant in a composition comprising a Vitamin K-dependent protein of interest, wherein at least one protein contaminant expressed by the host cell is inhibited is a method comprising the use of Random mutagenesis.

In one embodiment the method for reducing the content of at least one protein contaminant in a composition comprising a Vitamin K-dependent protein of interest, wherein at least one protein contaminant expressed by the host cell is inhibited is a method comprising the use of Targeted knock-out.

The present invention furthermore relates to a nucleic acid sequence comprising the CHO Protein S cDNA sequence having the sequence of SEQ ID NO 3 or any functional fragments thereof.

The present invention also relates to a nucleic acid sequence comprising the CHO Protein S coding sequence having the sequence of SEQ ID NO 4 or any functional fragments thereof.

The present invention relates to an amino acid sequence comprising CHO Protein S sequence having the sequence of SEQ ID NO 5 or any functional fragments thereof.

The methods and means described herein may in principle be applied for generating compositions comprising any Vitamin K-dependent protein of interest which is devoid or substantially devoid of protein contaminants.

“Polypeptides” means any protein comprising the amino acid sequence of the wild-type protein, as well as their respective “variants”, “related polypeptides”, “derivatives” and “conjugates” thereof.

In particular, as used herein, the terms “Factor VII polypeptide” or “FVII polypeptide” means any protein comprising the amino acid sequence 1-406 of wild-type human Factor VIIa (i.e., a polypeptide having the amino acid sequence disclosed in U.S. Pat. No. 4,784,950), variants thereof as well as Factor VII-related polypeptides, Factor VII derivatives and Factor VII conjugates. This includes FVII variants, Factor VII-related polypeptides, Factor VII derivatives and Factor VII conjugates exhibiting substantially the same or improved biological activity relative to wild-type human Factor VIIa.

The term “Factor VII” is intended to encompass Factor VII polypeptides in their uncleaved (zymogen) form, as well as those that have been proteolytically processed to yield their respective bioactive forms, which may be designated Factor VIIa. Typically, Factor VII is cleaved between residues 152 and 153 to yield Factor VIIa.

Variants of Factor VII may exhibit different properties relative to human Factor VII, including stability, phospholipid binding, altered specific activity, and the like.

“Factor VII” or “Factor VIIa” within the above definition also includes natural allelic variations that may exist and occur from one individual to another. Also, degree and location of glycosylation or other post-translation modifications may vary depending on the chosen host cells and the nature of the host cellular environment.

As used herein, “wild type human FVIIa” is a polypeptide having the amino acid sequence disclosed in U.S. Pat. No. 4,784,950.

The term “Factor VII derivative” as used herein, is intended to designate a FVII polypeptide exhibiting substantially the same or improved biological activity relative to wild-type Factor VII, in which one or more of the amino acids of the parent peptide have been genetically and/or chemically and/or enzymatically modified, e.g. by alkylation, glycosylation, PEGylation, acylation, ester formation or amide formation or the like. This includes but is not limited to PEGylated human Factor VIIa, cysteine-PEGylated human Factor VIIa and variants thereof. Non-limiting examples of Factor VII derivatives includes GlycoPegylated FVII derivatives as disclosed in WO 03/31464 and U.S. patent applications U.S. 20040043446, U.S. 20040063911, U.S. 20040142856, U.S. 20040137557, and U.S. 20040132640 (Neose Technologies, Inc.); FVII conjugates as disclosed in WO 01/04287, U.S. patent application 20030165996, WO 01/58935, WO 03/93465 (Maxygen ApS) and WO 02/02764, U.S. patent application 20030211094 (University of Minnesota).

The term “improved biological activity” refers to FVII polypeptides with i) substantially the same or increased proteolytic activity compared to recombinant wild type human Factor VIIa or ii) to FVII polypeptides with substantially the same or increased TF binding activity compared to recombinant wild type human Factor VIIa or iii) to FVII polypeptides with substantially the same or increased half life in blood plasma compared to recombinant wild type human Factor VIIa. The term “PEGylated human Factor VIIa” means human Factor VIIa, having a PEG molecule conjugated to a human Factor VIIa polypeptide. It is to be understood, that the PEG molecule may be attached to any part of the Factor VIIa polypeptide including any amino acid residue or carbohydrate moiety of the Factor VIIa polypeptide. The term “cysteine-PEGylated human Factor VIIa” means Factor VIIa having a PEG molecule conjugated to a sulfhydryl group of a cysteine introduced in human Factor VIIa.

Non-limiting examples of Factor VII variants having substantially the same or increased proteolytic activity compared to recombinant wild type human Factor VIIa include S52A-FVIIa, S60A-FVIIa (Lino et al., Arch. Biochem. Biophys. 352: 182-192, 1998); FVIIa variants exhibiting increased proteolytic stability as disclosed in U.S. Pat. No. 5,580,560; Factor VIIa that has been proteolytically cleaved between residues 290 and 291 or between residues 315 and 316 (Mollerup et al., Biotechnol. Bioeng. 48:501-505, 1995); oxidized forms of Factor VIIa (Kornfelt et al., Arch. Biochem. Biophys. 363:43-54, 1999); FVII variants as disclosed in PCT/DK02/00189 (corresponding to WO 02/077218); and FVII variants exhibiting increased proteolytic stability as disclosed in WO 02/38162 (Scripps Research Institute); FVII variants having a modified Gla-domain and exhibiting an enhanced membrane binding as disclosed in WO 99/20767, U.S. Pat. No. 6,017,882 and U.S. Pat. No. 6,747,003, U.S. patent application 20030100506 (University of Minnesota) and WO 00/66753, U.S. patent applications U.S. 20010018414, U.S. 2004220106, and U.S. 200131005, U.S. Pat. No. 6,762,286 and U.S. Pat. No. 6,693,075 (University of Minnesota); and FVII variants as disclosed in WO 01/58935, U.S. Pat. No. 6,806,063, U.S. patent application 20030096338 (Maxygen ApS), WO 03/93465 (Maxygen ApS), WO 04/029,091 (Maxygen ApS), WO 04/083,361 (Maxygen ApS), and WO 04/111,242 (Maxygen ApS), as well as in WO 04/108,763 (Canadian Blood Services).

Non-limiting examples of FVII variants having increased biological activity compared to wild-type FVIIa include FVII variants as disclosed in WO 01/83725, WO 02/22776, WO 02/077218, WO 03/027147, WO 04/029090, WO 05/075635, and European patent application with application number 05108713.8 (Novo Nordisk A/S), WO 02/38162 (Scripps Research Institute); and FVIIa variants with enhanced activity as disclosed in JP 2001061479 (Chemo-Sero-Therapeutic Res Inst.).

Examples of variants of factor VII include, without limitation, P10Q-FVII, K32E-FVII, P10Q/K32E-FVII, L305V-FVII, L305V/M306D/D309S-FVII, L305I-FVII, L305T-FVII, F374P-FVII, V158T/M298Q-FVII, V158D/E296V/M298Q-FVII, K337A-FVII, M298Q-FVII, V158D/M298Q-FVII, L305V/K337A-FVII, V158D/E296V/M298Q/L305V-FVII, V158D/E296V/M298Q/K337A-FVII, V158D/E296V/M298Q/L305V/K337A-FVII, K157A-FVII, E296V-FVII, E296V/M298Q-FVII, V158D/E296V-FVII, V158D/M298K-FVII, and S336G-FVII, L305V/K337A-FVII, L305V/V158D-FVII, L305V/E296V-FVII, L305V/M298Q-FVII, L305V/V158T-FVII, L305V/K337A/V158T-FVII, L305V/K337A/M298Q-FVII, L305V/K337A/E296V-FVII, L305V/K337A/V158D-FVII, L305V/V158D/M298Q-FVII, L305V/V158D/E296V-FVII, L305V/V158T/M298Q-FVII, L305V/V158T/E296V-FVII, L305V/E296V/M298Q-FVII, L305V/V158D/E296V/M298Q-FVII, L305V/V158T/E296V/M298Q-FVII, L305V/V158T/K337A/M298Q-FVII, L305V/V158T/E296V/K337A-FVII, L305V/V158D/K337A/M298Q-FVII, L305V/V158D/E296V/K337A-FVII, L305V/V158D/E296V/M298Q/K337A-FVII, L305V/V158T/E296V/M298Q/K337A-FVII, S314E/K316H-FVII, S314E/K316Q-FVII, S314E/L305V-FVII, S314E/K337A-FVII, S314E/V158D-FVII, S314E/E296V-FVII, S314E/M298Q-FVII, S314E/V158T-FVII, K316H/L305V-FVII, K316H/K337A-FVII, K316H/V158D-FVII, K316H/E296V-FVII, K316H/M298Q-FVII, K316H/V158T-FVII, K316Q/L305V-FVII, K316Q/K337A-FVII, K316Q/V158D-FVII, K316Q/E296V-FVII, K316Q/M298Q-FVII, K316Q/V158T-FVII, S314E/L305V/K337A-FVII, S314E/L305V/V158D-FVII, S314E/L305V/E296V-FVII, S314E/L305V/M298Q-FVII, S314E/L305V/V158T-FVII, S314E/L305V/K337A/V158T-FVII, S314E/L305V/K337A/M298Q-FVII, S314E/L305V/K337A/E296V-FVII, S314E/L305V/K337A/V158D-FVII, S314E/L305V/V158D/M298Q-FVII, S314E/L305V/V158D/E296V-FVII, S314E/L305V/V158T/M298Q-FVII, S314E/L305V/V158T/E296V-FVII, S314E/L305V/E296V/M298Q-FVII, S314E/L305V/V158D/E296V/M298Q-FVII, S314E/L305V/V158T/E296V/M298Q-FVII, S314E/L305V/V158T/K337A/M298Q-FVII, S314E/L305V/V158T/E296V/K337A-FVII, S314E/L305V/V158D/K337A/M298Q-FVII, S314E/L305V/V158D/E296V/K337A-FVII, S314E/L305V/V158D/E296V/M298Q/K337A-FVII, S314E/L305V/V158T/E296V/M298Q/K337A-FVII, K316H/L305V/K337A-FVII, K316H/L305V/V158D-FVII, K316H/L305V/E296V-FVII, K316H/L305V/M298Q-FVII, K316H/L305V/V158T-FVII, K316H/L305V/K337A/V158T-FVII, K316H/L305V/K337A/M298Q-FVII, K316H/L305V/K337A/E296V-FVII, K316H/L305V/K337A/V158D-FVII, K316H/L305V/V158D/M298Q-FVII, K316H/L305V/V158D/E296V-FVII, K316H/L305V/V158T/M298Q-FVII, K316H/L305V/V158T/E296V-FVII, K316H/L305V/E296V/M298Q-FVII, K316H/L305V/V158D/E296V/M298Q-FVII, K316H/L305V/V158T/E296V/M298Q-FVII, K316H/L305V/V158T/K337A/M298Q-FVII, K316H/L305V/V158T/E296V/K337A-FVII, K316H/L305V/V158D/K337A/M298Q-FVII, K316H/L305V/V158D/E296V/K337A-FVII, K316H/L305V/V158D/E296V/M298Q/K337A-FVII, K316H/L305V/V158T/E296V/M298Q/K337A-FVII, K316Q/L305V/K337A-FVII, K316Q/L305V/V158D-FVII, K316Q/L305V/E296V-FVII, K316Q/L305V/M298Q-FVII, K316Q/L305V/V158T-FVII, K316Q/L305V/K337A/V158T-FVII, K316Q/L305V/K337A/M298Q-FVII, K316Q/L305V/K337A/E296V-FVII, K316Q/L305V/K337A/V158D-FVII, K316Q/L305V/V158D/M298Q-FVII, K316Q/L305V/V158D/E296V-FVII, K316Q/L305V/V158T/M298Q-FVII, K316Q/L305V/V158T/E296V-FVII, K316Q/L305V/E296V/M298Q-FVII, K316Q/L305V/V158D/E296V/M298Q-FVII, K316Q/L305V/V158T/E296V/M298Q-FVII, K316Q/L305V/V158T/K337A/M298Q-FVII, K316Q/L305V/V158T/E296V/K337A-FVII, K316Q/L305V/V158D/K337A/M298Q-FVII, K316Q/L305V/V158D/E296V/K337A-FVII, K316Q/L305V/V158D/E296V/M298Q/K337A-FVII, K316Q/L305V/V158T/E296V/M298Q/K337A-FVII, F374Y/K337A-FVII, F374Y/V158D-FVII, F374Y/E296V-FVII, F374Y/M298Q-FVII, F374Y/V158T-FVII, F374Y/S314E-FVII, F374Y/L305V-FVII, F374Y/L305V/K337A-FVII, F374Y/L305V/V158D-FVII, F374Y/L305V/E296V-FVII, F374Y/L305V/M298Q-FVII, F374Y/L305V/V158T-FVII, F374Y/L305V/S314E-FVII, F374Y/K337A/S314E-FVII, F374Y/K337A/V158T-FVII, F374Y/K337A/M298Q-FVII, F374Y/K337A/E296V-FVII, F374Y/K337A/V158D-FVII, F374Y/V158D/S314E-FVII, F374Y/V158D/M298Q-FVII, F374Y/V158D/E296V-FVII, F374Y/V158T/S314E-FVII, F374Y/V158T/M298Q-FVII, F374Y/V158T/E296V-FVII, F374Y/E296V/S314E-FVII, F374Y/S314E/M298Q-FVII, F374Y/E296V/M298Q-FVII, F374Y/L305V/K337A/V158D-FVII, F374Y/L305V/K337A/E296V-FVII, F374Y/L305V/K337A/M298Q-FVII, F374Y/L305V/K337A/V158T-FVII, F374Y/L305V/K337A/S314E-FVII, F374Y/L305V/V158D/E296V-FVII, F374Y/L305V/V158D/M298Q-FVII, F374Y/L305V/V158D/S314E-FVII, F374Y/L305V/E296V/M298Q-FVII, F374Y/L305V/E296V/V158T-FVII, F374Y/L305V/E296V/S314E-FVII, F374Y/L305V/M298Q/V158T-FVII, F374Y/L305V/M298Q/S314E-FVII, F374Y/L305V/V158T/S314E-FVII, F374Y/K337A/S314E/V158T-FVII, F374Y/K337A/S314E/M298Q-FVII, F374Y/K337A/S314E/E296V-FVII, F374Y/K337A/S314E/V158D-FVII, F374Y/K337A/V158T/M298Q-FVII, F374Y/K337A/V158T/E296V-FVII, F374Y/K337A/M298Q/E296V-FVII, F374Y/K337A/M298Q/V158D-FVII, F374Y/K337A/E296V/V158D-FVII, F374Y/V158D/S314E/M298Q-FVII, F374Y/V158D/S314E/E296V-FVII, F374Y/V158D/M298Q/E296V-FVII, F374Y/V158T/S314E/E296V-FVII, F374Y/V158T/S314E/M298Q-FVII, F374Y/V158T/M298Q/E296V-FVII, F374Y/E296V/S314E/M298Q-FVII, F374Y/L305V/M298Q/K337A/S314E-FVII, F374Y/L305V/E296V/K337A/S314E-FVII, F374Y/E296V/M298Q/K337A/S314E-FVII, F374Y/L305V/E296V/M298Q/K337A-FVII, F374Y/L305V/E296V/M298Q/S314E-FVII, F374Y/V158D/E296V/M298Q/K337A-FVII, F374Y/V158D/E296V/M298Q/S314E-FVII, F374Y/L305V/V158D/K337A/S314E-FVII, F374Y/V158D/M298Q/K337A/S314E-FVII, F374Y/V158D/E296V/K337A/S314E-FVII, F374Y/L305V/V158D/E296V/M298Q-FVII, F374Y/L305V/V158D/M298Q/K337A-FVII, F374Y/L305V/V158D/E296V/K337A-FVII, F374Y/L305V/V158D/M298Q/S314E-FVII, F374Y/L305V/V158D/E296V/S314E-FVII, F374Y/V158T/E296V/M298Q/K337A-FVII, F374Y/V158T/E296V/M298Q/S314E-FVII, F374Y/L305V/V158T/K337A/S314E-FVII, F374Y/V158T/M298Q/K337A/S314E-FVII, F374Y/V158T/E296V/K337A/S314E-FVII, F374Y/L305V/V158T/E296V/M298Q-FVII, F374Y/L305V/V158T/M298Q/K337A-FVII, F374Y/L305V/V158T/E296V/K337A-FVII, F374Y/L305V/V158T/M298Q/S314E-FVII, F374Y/L305V/V158T/E296V/S314E-FVII, F374Y/E296V/M298Q/K337A/V158T/S314E-FVII, F374Y/V158D/E296V/M298Q/K337A/S314E-FVII, F374Y/L305V/V158D/E296V/M298Q/S314E-FVII, F374Y/L305V/E296V/M298Q/V158T/S314E-FVII, F374Y/L305V/E296V/M298Q/K337A/V158T-FVII, F374Y/L305V/E296V/K337A/V158T/S314E-FVII, F374Y/L305V/M298Q/K337A/V158T/S314E-FVII, F374Y/L305V/V158D/E296V/M298Q/K337A-FVII, F374Y/L305V/V158D/E296V/K337A/S314E-FVII, F374Y/L305V/V158D/M298Q/K337A/S314E-FVII, F374Y/L305V/E296V/M298Q/K337A/V158T/S314E-FVII, F374Y/L305V/V158D/E296V/M298Q/K337A/S314E-FVII, S52A-Factor VII, S60A-Factor VII; R152E-Factor VII, S344A-Factor VII, T106N-FVII, K143N/N145T-FVII, V253N-FVII, R290N/A292T-FVII, G291N-FVII, R315N/V317T-FVII, K143N/N145T/R315N/V317T-FVII; and FVII having substitutions, additions or deletions in the amino acid sequence from 233Thr to 240Asn; FVII having substitutions, additions or deletions in the amino acid sequence from 304Arg to 329Cys; and FVII having substitutions, additions or deletions in the amino acid sequence from 153Ile to 223Arg.

Thus, substitution variants in a factor VII polypeptide include, without limitation substitutions in positions P10, K32, L305, M306, D309, L305, L305, F374, V158, M298, V158, E296, K337, M298, M298, S336, S314, K316, K316, F374, S52, S60, R152, S344, T106, K143, N145, V253, R290, A292, G291, R315, V317, and substitutions, additions or deletions in the amino acid sequence from T233 to N240 or from R304 to C329; or from 1153 to R223, or combinations thereof, in particular variants such as P10Q, K32E, L305V, M306D, D309S, L305I, L305T, F374P, V158T, M298Q, V158D, E296V, K337A, M298Q, M298K, S336G, S314E, K316H, K316Q, F374Y, S52A, S60A, R152E, S344A, T106N, K143N, N145T, V253N, R290N, A292T, G291N, R315N, V317T, and substitutions, additions or deletions in the amino acid sequence from T233 to N240, or from R304 to C329, or from I153 to R223, or combinations thereof.

“A Vitamin K-dependent protein of interest” as used herein refers to the single Vitamin K-dependent protein product produced by the host cells, which is relevant to obtain in the most pure form. In one embodiment vitamin K-dependent protein of interest is the protein product produced in the highest amount by the host cell. In one embodiment, the Vitamin K-dependent protein of interest in transfected into the host cell.

“Composition” as used herein, means any composition, such as a liquid composition, such as an aqueous liquid composition.

The Vitamin K-dependent protein of interest is most typically one produced under cell culture conditions, i.e. the Vitamin K-dependent protein of interest is either obtained directly as a constituent of a cell culture supernatant, or obtained from a cell culture supernatant after one or more subsequent purification process steps.

Typically, the total content of protein contaminants in the non-purified composition is at least 200 ppm, such as at least 300 ppm, e.g. at least 400 ppm, or at least 500 ppm. Also typically, the total content of Protein S contaminants in the non-purified composition is at least 200 ppm, such as at least 300 ppm, e.g. at least 400 ppm, or at least 500 ppm.

“Protein contaminant” and “protein contaminants” as used herein, means protein or polypeptide constituents produced endogenously by the host cell and constituting an impurity in relation to the Vitamin K-dependent protein of interest. Thus, the Vitamin K-dependent protein of interest is obviously not be counted as a protein contaminant.

“Devoid or substantially devoid” as used herein, refers to a composition wherein the total content of a protein contaminant in the composition is at the most 500 ppm, such as at the most 100 ppm, such as at the most 10 ppm, e.g. at the most 1 ppm, or at the most 0.1 ppm. Also typically, the total content of Protein S contaminants in the composition is at the most 500 ppm, such as at the most 100 ppm, such as at the most 10 ppm, e.g. at the most 1 ppm, or at the most 0.1 ppm.

The phrase “express a substantially lower amount of at least one protein contaminant” as used herein, refers to the expression level of an endogenous protein contaminant, which is reduced by at least 30%, such as by at least 40%, such as by at least 50%, such as by at least 60%, such as by at least 80%, such as by at least 90%, such as by at least 95%, such as by at least 99%.

A particularly relevant class of protein contaminant are proteins very similar to the Vitamin K-dependent protein of interest, such as any other protein containing a Gla-domain including the proteins: GAS-6, Protein S, Factor II (Prothrombin), Factor Xa, Factor IXa, Protein C, Factor VIIa, Protein Z, Transmembrane gamma-carboxyglutamic acid protein 1, Transmembrane gamma-carboxyglutamic acid protein 2, Transmembrane gamma carboxyglutamic acid protein 3, Transmembrane gamma-carboxyglutamic acid protein 4, Matrix Gla protein, and Osteocalcin.

As a non-limiting example, Protein S is sometimes seen as an impurity in the production of recombinant FVIIa in mammalian cells. Protein S is like FVII a Vitamin K-dependent plasma glycoprotein containing an EGF-like domain and a gamma-carboxy-glutamate (Gla) domain. Due to the structural similarity between FVII(a) and Protein S, it is difficult to recover FVII by means of chromatographic methods supra without contamination with Protein S. It would therefore be desirable to prevent the expression of Protein S by the host cell. This may be obtained by targeting the mRNA or the genome.

Stable expression of small interfering RNA, siRNA, is a new technology that enables reduction of targeted mRNA and thus suppression of targeted gene expression in mammalian cells (T. R. Brummelkamp, R. Bernards, and R. Agami. Science 296(5567): 550-553, 2002 & M. Mivaaishi and K Taira. Nat. Biotechnol 20(5):497-200, 2002.) A number of individual siRNA have been generated in a strategy similar to the ones described in the references. Some of these siRNAs have proven useful (Example 2).

The use of random mutagenesis to introduce genomic changes in the host cells, some of which may prevent the generation of mRNA in the host cell may also be exploited. This may be achieved by treating a population of CHO cells with a mutagen such as e.g. Ethyl Methane Sulfonate, EMS, which induces point mutations in the cells. The surviving cells may exhibit altered phenotypes, because of these mutations. The cells may be seeded in a screening format (e.g. 96-well plates) to allow isolation of clonal cell populations. Following a growth period, medium may be harvested from the wells and assayed for Protein S content. Clones without Protein S expression may be isolated and used for production of Protein S-free Factor VII.

Disruption of the genome may be obtained by gene targeting or the knock-out technique (Example 3). The generation of knock-out cells is a well-described technique for eradicating expression of endogenous proteins, and a CHO knock-out cell was recently described in Yamane-Ohnuki et al. Biotechnol. Bioeng. 87 (5):614-622, 2004.

Genomic Protein S knockout plasmid was generated and transfected into CHO cells. By homologous recombination the Protein S gene in the CHO cells was disrupted. This procedure was repeated until all alleles of the Protein S gene was stably removed (Example 3).

Transcription factor engineering for transcriptional down regulation is an alternative way of modifying the gene expression (Example 4).

These methods may in theory be suitable for removing any unwanted host cell protein contaminants. For all of these methods to be applied it requires the knowledge of the gene sequence of the contaminating protein. The sequence of Protein S for Chinese Ovary Hamster, CHO, is not public available and a cloning of CHO Protein S cDNA was performed as described in Example 1 and disclosed as SEQ ID NO 1. The CHO Protein S coding sequence and the CHO Protein S amino acid sequence are disclosed as SEQ ID NO 2 and 3 respectively.

The present invention is further illustrated by the following examples which, however, are not to be construed as limiting the scope of protection. The features disclosed in the foregoing description and in the following examples may, both separately and in any combination thereof, be material for realising the invention in diverse forms thereof.

Embodiments of the Invention

-   1. A composition comprising a Vitamin K-dependent protein of     interest devoid or substantially devoid of at least one protein     contaminant expressed by a host cell. -   2. The composition according to embodiment 1, wherein the Vitamin     K-dependent protein of interest is selected from the group     consisting of GAS-6, Protein S, Factor II (Prothrombin), Factor X,     Factor IX, Protein C, Factor VII, Protein Z, Transmembrane     gamma-carboxyglutamic acid protein 1, Transmembrane     gamma-carboxyglutamic acid protein 2, Transmembrane gamma     carboxyglutamic acid protein 3, Transmembrane gamma-carboxyglutamic     acid protein 4, Bone Gla protein, Matrix Gla protein, and     Osteocalcin. -   3. The composition according to embodiment 1, wherein at least one     of said protein contaminants is a vitamin K-dependent protein. -   4. The composition according to embodiment 1, wherein at least one     of said protein contaminants is Protein S. -   5. The composition according to embodiment 1, wherein the host cell     is selected from the group consisting of CHO cells, 293 (HEK293)     cells, BKH cells, HKB11 cells, SP2/0 cells, and NS0 cells. -   6. A cell expressing a Vitamin K-dependent protein of interest     according to any of embodiments 1-5 further comprising a siRNA     construct targeting at least one protein contaminant expressed by     the host cell. -   7. The cell according to embodiment 6, wherein said at least one     protein contaminant is a vitamin K-dependent protein. -   8. The cell according to any of embodiments 6-7, wherein said at     least one protein contaminant is Protein S. -   9. A cell expressing a Vitamin K-dependent protein of interest     according to any of embodiments 1-5 further comprising a disrupted     gene for at least one protein contaminant expressed by the host     cell. -   10. The cell according to embodiment 9, wherein said at least one     protein contaminant is a vitamin K-dependent protein -   11. The cell according to any of embodiments 9-10, wherein said at     least one protein contaminant is Protein S. -   12. The cell according to any of embodiments 9-11, wherein the     Protein S gene is disrupted by omission of exon 1 -   13. A cell expressing a Vitamin K-dependent protein of interest     according to any of embodiments 1-5 further comprising a     transcription factor binding to at least one protein contaminant     expressed by the host cell. -   14. The cell according to embodiment 13, wherein said at least one     protein contaminant is a vitamin K-dependent protein -   15. The cell according to any of embodiments 13-14, wherein said at     least one protein contaminant is Protein S. -   16. The cell according to any of embodiments 13-15, wherein the     transcription factor is a Zinc finger protein. -   17. The cell according to any of embodiments 15-16, wherein the Zinc     finger protein binds the GGAGAGGAGGGGGGG DNA element. -   18. A method for reducing the content of at least one protein     contaminant in a composition comprising a Vitamin K-dependent     protein of interest wherein at least one protein contaminant     expressed by the host cell is inhibited -   19. The method according to embodiment 18, wherein the method     comprises the use of siRNA. -   20. The method according to embodiment 18, wherein the method     comprises the use of Random mutagenesis. -   21. The method according to embodiment 18, wherein the method     comprises the use of Targeted knock-out. -   22. A nucleic acid sequence comprising the CHO Protein S cDNA     sequence having the sequence of SEQ ID NO 1. -   23. A nucleic acid sequence comprising the CHO Protein S coding     sequence having the sequence of SEQ ID NO 2. -   24. An amino acid sequence comprising CHO Protein S sequence having     the sequence of SEQ ID NO 3.

Further Embodiments of the Invention:

-   1a. A host cell expressing a Vitamin K-dependent protein of     interest, said host cell being modified to express a substantially     lower amount of at least one protein contaminant expressed     endogenous by said host cell in the absence of said modification. -   2a. The host cell according to embodiment 1a, wherein said Vitamin     K-dependent protein of interest is selected from the group     consisting of GAS-6, Protein S, Factor II (Prothrombin), Factor X,     Factor IX, Protein C, Factor VII, Protein Z, Transmembrane     gamma-carboxyglutamic acid protein 1, Transmembrane     gamma-carboxyglutamic acid protein 2, Transmembrane gamma     carboxyglutamic acid protein 3, Transmembrane gamma-carboxyglutamic     acid protein 4, Bone Gla protein, Matrix Gla protein, and     Osteocalcin. -   3a. The host cell according to any one of embodiments 1a-2a, wherein     said protein contaminants is a second vitamin K-dependent protein. -   4a. The host cell according to any one of embodiments 1a-3a, wherein     said protein contaminant is Protein S. -   5a. The host cell according to any one of embodiments 1a-4-a,     wherein the host cell is selected from the group consisting of CHO     cells, 293 (HEK293) cells, BKH cells, HKB11 cells, SP2/0 cells, and     NS0 cells. -   6a. The host cell according to any one of embodiments 1a-5a, wherein     said cell has been modified by transfection with at least one siRNA     polynucleotide construct targeting a mRNA encoding a protein     contaminant expressed endogenous by said host cell. -   7a. The host cell according to any one of embodiments 1a-6a, wherein     said cell has been modified by disruption by gene knock-out of at     least one endogenous gene encoding a protein contaminant expressed     endogenous by said host cell. -   8a. The host cell according to embodiment 7a, wherein the endogenous     gene encoding Protein S has been disrupted by gene knock-out of exon     1. -   9a. The host cell according to any one of embodiments 1a-8a, wherein     said cell has been modified by transfection with at least one     transcription factor binding to a DNA element of the gene encoding     said protein contaminant expressed endogenous by said host cell.     10a. The host cell according to embodiment 9a, wherein said     transcription factor is a Zinc finger protein. -   11a. The host cell according to embodiment 10a, wherein said Zinc     finger protein binds a DNA element comprising the sequence of SEQ ID     NO 35. -   12a. The host cell according to any one of embodiments 1a-11a,     wherein said cell has been modified by random mutagenesis for     disruption of at least one endogenous gene encoding a protein     contaminant expressed endogenous by said host cell. -   13a. A method for producing a host cell according to any one of     embodiments 1a-12a, said method comprising the following steps in     any order: -   a) optionally transfecting said cell with a polynucleotide construct     encoding a Vitamin K-dependent protein of interest; and -   b) modifying said cell to express a substantially lower amount of at     least one protein contaminant expressed endogenous by said host cell     in the absence of said modification. -   14a. The method according to embodiment 13a, wherein said Vitamin     K-dependent protein of interest is selected from the group     consisting of GAS-6, Protein S, Factor II (Prothrombin), Factor X,     Factor IX, Protein C, Factor VII, Protein Z, Transmembrane     gamma-carboxyglutamic acid protein 1, Transmembrane     gamma-carboxyglutamic acid protein 2, Transmembrane gamma     carboxyglutamic acid protein 3, Transmembrane gamma-carboxyglutamic     acid protein 4, Bone Gla protein, Matrix Gla protein, and     Osteocalcin. -   15a. The method according to any one of embodiments 13a-14a, wherein     said protein contaminants is a second vitamin K-dependent protein. -   16a. The method according to any one of embodiments 13a-15a, wherein     said protein contaminant is Protein S. -   17a. The method according to any one of embodiments 13a-16a, wherein     the host cell is selected from the group consisting of CHO cells,     293 (HEK293) cells, BKH cells, HKB11 cells, SP2/0 cells, and NS0     cells. -   18a. The method according to any one of embodiments 13a-17a, wherein     said cell has been modified by transfection with at least one siRNA     polynucleotide construct targeting a mRNA encoding a protein     contaminant expressed endogenous by said host cell. -   19a. The method according to any one of embodiments 13a-18a, wherein     said cell has been modified by disruption by gene knock-out of at     least one endogenous gene encoding a protein contaminant expressed     endogenous by said host cell. -   20a. The method according to embodiment 19a, wherein the endogenous     gene encoding Protein S has been disrupted by gene knock-out of exon     1. -   21a. The method according to any one of embodiments 13a-20a, wherein     said cell has been modified by transfection with at least one     transcription factor binding to a DNA element of the gene encoding     said protein contaminant expressed endogenous by said host cell. -   22a. The method according to embodiment 21a, wherein said     transcription factor is a Zinc finger protein. -   23a. The method according to embodiment 22a, wherein said Zinc     finger protein binds a DNA element comprising the sequence of SEQ ID     NO 35. -   24a. The method according to any one of embodiments 13a-23a, wherein     said cell has been modified by random mutagenesis for disruption of     at least one endogenous gene encoding a protein contaminant     expressed endogenous by said host cell. -   25a. A method for producing a composition comprising a Vitamin     K-dependent protein of interest with a substantially lower amount of     at least one protein contaminant expressed endogenous by said host     cell in the absence of modification, said method comprising the     steps of: -   a) producing a host cell according to any one methods of embodiments     13a-24a; and -   b) growing said host cell in a growth medium and harvesting said     growth medium comprising said Vitamin K-dependent protein of     interest. -   26a. A composition produced by the method according to embodiment     25a. -   27a. A nucleic acid sequence comprising the sequence of SEQ ID NO 1. -   28a. A nucleic acid sequence comprising the sequence of SEQ ID NO 2. -   29a. An amino acid sequence comprising the sequence of SEQ ID NO 3.

EXAMPLES Example 1 Cloning of CHO Protein S cDNA

The Chinese Hamster Ovary, CHO, Protein S cDNA sequence was not known from any nucleotide or protein database but was expected to have high identity to the nucleotide sequence of Protein S from other rodents.

CHO Protein S PCR fragments were generated from CHO cDNA using primers designed from alignment between mouse and rat Protein S cDNA sequences or genomic sequences. The cDNA fragments were sequenced and assembled to form a full-length coding sequence for the CHO Protein S gene. The full-length CHO Protein S cDNA was cloned by PCR using the primers “CHO ProtS forward” and “CHO ProtS reverse” and CHO-K1 derived cDNA as template.

The predicted CHO Protein S amino acid sequence has 90.5% identity to mouse Protein S and 90.7% identity to rat Protein S.

CHO ProtS forward (SEQ ID NO 1): 5′-GCCCAGGCTCGCAGCTCCTCTGG-3′ CHO ProtS reverse (SEQ ID NO 2): 5′-CAGGTGACACCTGCCAGCTGGTG-3′ CHO Protein S cDNA sequence (SEQ ID NO 3): gcccaggctcgcagctcctctgggcggagcgccggctcggtccccgctgcgccagccgtgatccccggcagcctgctcagcaa tgagggtcctgagcgcgcgctgtcggctactgctggtatgcctagccctggtgctgccagcctcggagacaaactttttgtcaa aagaacatgcctcgcaagtcctggtgaggaagcgccgcgcaaataccttgcttgaagaaactaaaaagggcaatcttgaaa gagaatgcatcgaagagctctgcaataaagaggaagccagggaggtctttgaaaacaatcccgaaacggattatttttatcc aaaatatttgggttgtctgggcatgttccgtgctggcctgttcagtgctgcgcggcagtctgttaatgcttaccccgacctcagga gctgtgtcaatgccatcccagaccaatgtgatcctatgccatgcaatgaagatgggtatctgagctgcaaagatggccaagct gctttcacatgcatctgcaaaccaggatggcaaggggacaaatgccagtttgatgtaaatgaatgtaaagatcccttaaatgta aatgggggctgcagccagatttgtgacaacactcctggaagttaccactgctcctgcagaagtggctttgctatgctttcaaaca aaaaagactgcaaagatgtggatgaatgctctatgaagcccagtgtttgtggctcagctgtgtgcaagaacactccaggaga ctatgagtgtgaatgtcctgacggctacagatatgatccctcatcgaagtcttgcaaagatgtggacgaatgctctgagaacat gtgtgctcaattgtgtgtcaattaccctggaggctactcttgttactgtgatggaaagaaaggattcaagcttgcccaagatcag aagagttgtgagggtattccagtgtgccttcccttgaaccttgacaaaaattatgaattattgtacttggctgagcagtttgtagg agttgtcttatatctgaaatttcgtttgccagaaattaccagattttcagctgaatttgattttcggacatatgattcagagggcatc atcctgtatgcagaatctcttgatcactcaaattggctcctgattgcacttcgtgatggaaaaattgaagttcagtttaagaatga gttttcaacccaaatcacaaccggaggcaatgttattaacaatggtaaatggaacatggtatccgtggaagaattagacgaca gtgttagcattaaaatagctaaagaagctgtgatgaatataaataaatttgggagcctctttaaacctacagatggatttctgga caccaaaatatactttgcaggattacctcgggtagtggaaagtgcactcattaaaccgattaaccctcgtctggatggatgtata cgaggctggaacttgatgaaacaaggagctttaggtgcaaaggaaattattcaaggaaaacaaaataagcattgcttcctcat ggtggagaagggctcctactaccctggttctggaattgctcggttcagcatagattacaataatgtaaccaatgcagagggctg gcaaataaatgtgaccttgaatattcgtccatccactggcactggaattatgcttgccttggtttctggagacaaagtgccctttg ccttgtccttggtgggctccagctctgaaaattctcaggatattgtggtatttgttgaaaattcagtggtggctcgaatggaggcc ataactctgtgttctgaccagcaatcccaactgaaatgtaatgttaacagacatggcctagagctatggagcccactgaagaa agatgtcatctactctaaagatattcaaggacaactagcagtcttggacaaagcaatgaaaggaaacgtggccacttatctgg gtggcattccagatctttccttcagtgccacgccagtgaatgccttctacagtggctgcatggaagtgaacatcaacggggtgc agttggatctggatgaagccatttctaaacataatgacatcagagctcactcatgtccttcagttaagaaaatccagaagaacg tctaatgtctgttttctgtgcttataatgcccctttccttgtaattatgctcacgcccctatcaccagctggcaggtgtcacctgtgaa gtgcaatgtttgaaatgatgtggtactttgtccttcagatttttgttatataaaccacgttttttttttttttttaaagtctttcttctattg ctgtctagaaattaaataa CHO Protein S coding sequence (SEQ ID NO 4): atgagggtcctgagcgcgcgctgtcggctactgctggtatgcctagccctggtgctgccagcctcggagacaaactttttgtca aaagaacatgcctcgcaagtcctggtgaggaagcgccgcgcaaataccttgcttgaagaaactaaaaagggcaatcttgaa agagaatgcatcgaagagctctgcaataaagaggaagccagggaggtctttgaaaacaatcccgaaacggattatttttatc caaaatatttgggttgtctgggcatgttccgtgctggcctgttcagtgctgcgcggcagtctgttaatgcttaccccgacctcagg agctgtgtcaatgccatcccagaccaatgtgatcctatgccatgcaatgaagatgggtatctgagctgcaaagatggccaagc tgctttcacatgcatctgcaaaccaggatggcaaggggacaaatgccagtttgatgtaaatgaatgtaaagatcccttaaatgt aaatgggggctgcagccagatttgtgacaacactcctggaagttaccactgctcctgcagaagtggctttgctatgctttcaaac aaaaaagactgcaaagatgtggatgaatgctctatgaagcccagtgtttgtggctcagctgtgtgcaagaacactccaggag actatgagtgtgaatgtcctgacggctacagatatgatccctcatcgaagtcttgcaaagatgtggacgaatgctctgagaaca tgtgtgctcaattgtgtgtcaattaccctggaggctactcttgttactgtgatggaaagaaaggattcaagcttgcccaagatca gaagagttgtgagggtattccagtgtgccttcccttgaaccttgacaaaaattatgaattattgtacttggctgagcagtttgtag gagttgtcttatatctgaaatttcgtttgccagaaattaccagattttcagctgaatttgattttcggacatatgattcagagggca tcatcctgtatgcagaatctcttgatcactcaaattggctcctgattgcacttcgtgatggaaaaattgaagttcagtttaagaat gagttttcaacccaaatcacaaccggaggcaatgttattaacaatggtaaatggaacatggtatccgtggaagaattagacga cagtgttagcattaaaatagctaaagaagctgtgatgaatataaataaatttgggagcctctttaaacctacagatggatttctg gacaccaaaatatactttgcaggattacctcgggtagtggaaagtgcactcattaaaccgattaaccctcgtctggatggatgt atacgaggctggaacttgatgaaacaaggagctttaggtgcaaaggaaattattcaaggaaaacaaaataagcattgcttcc tcatggtggagaagggctcctactaccctggttctggaattgctcggttcagcatagattacaataatgtaaccaatgcagagg gctggcaaataaatgtgaccttgaatattcgtccatccactggcactggaattatgcttgccttggtttctggagacaaagtgccc tttgccttgtccttggtgggctccagctctgaaaattctcaggatattgtggtatttgttgaaaattcagtggtggctcgaatggag gccataactctgtgttctgaccagcaatcccaactgaaatgtaatgttaacagacatggcctagagctatggagcccactgaag aaagatgtcatctactctaaagatattcaaggacaactagcagtcttggacaaagcaatgaaaggaaacgtggccacttatct gggtggcattccagatctttccttcagtgccacgccagtgaatgccttctacagtggctgcatggaagtgaacatcaacggggt gcagttggatctggatgaagccatttctaaacataatgacatcagagctcactcatgtccttcagttaagaaaatccagaagaa cgtctaa CHO Protein S amino acid sequence (SEQ ID NO 5): mrvlsarcrlllvclalvlpasetnflskehasqvlvrkrrantlleetkkgnlerecieelcnkeearevfennpetdyfypkylgcl gmfraglfsaarqsvnaypdlrscvnaipdqcdpmpcnedgylsckdgqaaftcickpgwqgdkcqfdvneckdplnvng gcsqicdntpgsyhcscrsgfamlsnkkdckdvdecsmkpsvcgsavckntpgdyececpdgyrydpssksckdvdecs enmcaqlcvnypggyscycdgkkgfklaqdqkscegipvclplnldknyellylaeqfvgvvlylkfrlpeitrfsaefdfrtyds egiilyaesldhsnwllialrdgkievqfknefstqittggnvinngkwnmvsveelddsvsikiakeavmninkfgslfkptdgf ldtkiyfaglprvvesalikpinprldgcirgwnlmkqgalgakeiiqgkqnkhcflmvekgsyypgsgiarfsidynnvtnaeg wqinvtlnirpstgtgimlalvsgdkvpfalslvgsssensqdivvfvensvvarmeaitlcsdqqsqlkcnvnrhglelwsplk kdviyskdiqgqlavldkamkgnvatylggipdlsfsatpvnafysgcmevningvqldldeaiskhndirahscpsvkkiqk nv

Example 2 CHO Protein S mRNA Degradation by Use of Small Interfering RNA

The mRNA of Protein S in Chinese Hamster Ovary (CHO) cells can be degraded by the introduction of small interfering RNA, siRNA, into the cells. siRNA is a short double-stranded RNA molecule that may separate inside the cell and the antisense part of the molecule may hybridize to a complementary mRNA and induce cleavage of this mRNA by a process in which the Dicer nuclease plays a key role. The effect of siRNA is described in Elbashir-S M et al., Nature 411 (2001) 494-498. siRNA may be synthesized as single-stranded RNA and subsequently annealed to form the double-stranded siRNA molecule. The siRNA molecule may subsequently be transiently transfected into cells and exert its function. Alternatively, siRNA may be expressed as a hairpin molecule under regulation of a Polymerase III promoter as described in Brummelkamp T R; Bernards R; Agami R, Science 296 (2002) 550-553.

A vector that permits the transcription of each of two complementary strands by individual promoters was developed in our laboratory. The vector is called RansiRNA because random DNA can be inserted into it and both strands of the insert can be transcribed. The RansiRNA vector contains the human H1 polymerase III promoter and the mouse U6 polymerase III promoter. The two promoters are pointed towards the siRNA template from each direction, transcribing the sense and antisense strand of the siRNA molecule, respectively. The vector also harbors the hygromycin drug resistance gene. The RansiRNA vector is similar, but not identical, to the pHippy vector described by Kaykas & Moon (Kaykas-A & Moon-R T, BMC Cell Biology Vol. 5 (1) pp. 16 (2004).

Several target siRNA sequences were selected from the CHO Protein S coding sequence, only targets containing the sequence AGN₁₇CT (SEQ ID NO 6) were chosen. Each target sequence were purchased as two complementary DNA oligonucleotides extended with A's 5′ to the target and T's 3′ the target which serve as termination signal when transcribed in reverse and forward direction. The oligonucleotides also harbor a four base-pair 5′-overhang which is compatible with the Bgl II restriction site (GATC). The annealed oligonucleotides are cloned into the BglII-site of the RansiRNA-hygro vector.

The specified siRNA constructs were stably transfected into a CHO K1 cell line expressing a human FVII analogue. Cells were plated in 6-well plates at density of 2×10⁵ c/well in complete medium (DMEM medium containing 10% FBS, non-essential amino acids and vitamin K). After two days the cells were transfected at 90% confluency. Transfection using Lipofectamine-2000 (Invitrogen) was performed according to recommendations from the manufacturer. After 48 hours the cells were transferred to selection medium, which was composed of complete medium additionally supplemented with 300 ug/ml hygromycin. After selection for 14 days, cells were cloned by limiting dilution. After clones had grown up the FBS containing complete medium was changed to serum free medium (PF CHO supplemented with vitamin K, Hyclone) in order to avoid detection of bovine protein S in the following ELISA. The supernatant from approximately 100 clones for each siRNA construct were screened by protein S ELISA using human protein S as standard. The same supernatant was also screened by a human FVII ELISA. The clones that had the lowest expression of protein S and that had not lost the expression of FVII were further characterized. Clones that had down regulated the expression of protein S to the level of only 10% of the expression level exhibited by the parental CHO K1 FVII expressing cell line were isolated.

siRNA Target Sequences

#821 siRNA1 target (SEQ ID NO 7): 5′-agtgtgaatgtcctgacggct-3′ #821 siRNA1 upper oligo (SEQ ID NO 8): 5′-gatctaaaaaagtgtgaatgtcctgacggctttttta-3′ #821 siRNA1 lower oligo (SEQ ID NO 9): 5′-gatctaaaaaagccgtcaggacattcacactttttta-3′ #822 siRNA2 target (SEQ ID NO 10): 5′-agctgcaaagatggccaagct-3′ #822 siRNA2 upper oligo (SEQ ID NO 11): 5′-gatctaaaaaagctgcaaagatggccaagctttttta-3′ #822 siRNA2 lower oligo (SEQ ID NO 12): 5′-gatctaaaaaagcttggccatctttgcagctttttta-3′ #835 siRNA4 target (SEQ ID NO 13): 5′-agaacatgcctcgcaagtcct-3′ #835 siRNA4 upper oligo (SEQ ID NO 14): 5′-gatctaaaaaagaacatgcctcgcaagtcctttttta-3′ #835 siRNA4 lower oligo (SEQ ID NO 15): 5′-gatctaaaaaaggacttgcgaggcatgttctttttta-3′ #836 siRNA5 target (SEQ ID NO 16): 5′-agaaactaaaaagggcaatct-3′ #836 siRNA5 upper oligo (SEQ ID NO 17): 5′-gatctaaaaaagaaactaaaaagggcaatctttttta-3′ #836 siRNA5 lower oligo (SEQ ID NO 18): 5′-gatctaaaaaagattgccctttttagtttctttttta-3′ #837 siRNA6 target (SEQ ID NO 19): 5′-agccagatttgtgacaacact-3′ #837 siRNA6 upper oligo (SEQ ID NO 20): 5′-gatctaaaaaagccagatttgtgacaacactttttta-3′ #837 siRNA6 lower oligo (SEQ ID NO 21): 5′-gatctaaaaaagtgttgtcacaaatctggctttttta-3′

Example 3 Gene Targeting of CHO Protein S

A definitive way of abolishing protein expression of Protein S is to disrupt the gene. The technique of “gene targeting” or “gene knock-out” in mice has been known for many years. Gene targeting in cultured cells is also well established, and an example of a CHO knock-out cell was recently described in Yamane-Ohnukiet et al. Biotechnology and Bioengineering, 87(5): 614-622, 2004.

We predicted the exon structure of the CHO Protein S gene by an alignment of the CHO Protein S cDNA to the human gene. Primers designed to bind in exon 1 and exon 2 of CHO Protein S was used in a Polymerase Chain Reaction, PCR, the template was CHO genomic DNA. The amplified 4.4 kb product was sequenced. Primers binding exon 2 and exon 3 was used to PCR amplify intron 2. An amplified fragment of 3.5 kb harboring intron 2 was cloned and sequenced.

The regions upstream of exon 1 from mouse and rat Protein S were aligned and sequence stretches with high identity were used to design oligonucleotide primers for use in PCR. A 1650 bp 5′UT/promoter band was cloned and sequenced. The gene targeting construct will combine the “1.6 kb 5′ UT/promoter fragment”, “Plox-PGK-hygromycin resistance gene-Plox”, “intron 1”, “exon 2” and “PGK-TK”. This construct is omitting the coding sequence from exon 1 in CHO Protein S, encoding the amino acids MRVLSVRCRLLLVCLALVLPASETN (SEQ ID NO 22). A second construct containing the blasticidin drug resistance gene can be made in a similar way.

The “hygromycin”-gene targeting construct can electroporated into CHO cells and cells plated in dishes. The next days the cells are exposed to 600 microg/ml hygromycin and 1 micromolar ganciclovir. The clones are now selected for hygromycin resistance gene and against herpes simplex thymidine kinase gene. After colonies appeared they will be transferred to 96 wells plates. The cells grow to confluence and duplicates of the plates will be made. Genomic DNA is harvested from the cell clones and PCR-reactions using a hygromycin resistance gene specific primer and a primer 3′ the promoter present in the construct are performed. Clones with a positive PCR-band are grown in flasks and a Southern blot will be made to verify the PCR result. Second, the targeting construct harboring the blasticidin resistance gene are electroporated into the hemizygous CHO cells whereafter the cells are selected for the blasticidin resistance gene and against thymidine kinase gene using 10 microg/ml blasticidin and 1 micro-molar ganciclovir. Again, cell clones are PCR verified using a blasticidin resistance gene specific primer and a primer 3′ to the promoter present in the construct. Positive clones are again tested by Southern blots. Homozygous disruptants are transfected with a Cre recombinase expressing plasmid. Cre recombinase will recombine the lox sites and remove the drug-resistance genes.

Intron 1 primers: CHO-protein-S-exon1-forw2 (SEQ ID NO 23): 5′-CTGCTGGTATGCCTAGCCCTGGTG-3′ CHO-protein-S-exon2-rev2 (SEQ ID NO 24): 5′-TGCAGAGCTCTTCGATGCATTCTC-3′ Intron 2 primers: CHO-protein-S-exon2-forw2 (SEQ ID NO 25): 5′-AAGGGCAATCTTGAAAGAGAATGC-3′ CHO-protein-S-exon3rev (SEQ ID NO 26): 5′-CCAAATATTTTGGATAAAAATAATC-3′ 5′UT/promoter primers: PS-CHO promoter f2 (SEQ ID NO 27): 5′-AARCAACCCCTTTTGACCAT-3′ CHO-protein-S.promoterRev1 (SEQ ID NO 28): 5′-CCCAGAGGAGCTGCGAGCCTG-3′ 5′UT/promoter (immediately 5′ to coding sequence) (SEQ ID NO 29): aarcaaccccttttgaccatacacatttctactctttgtgtttgctggagctgttttctccccacactcaaccccctttgctgaagcct ggaacttgctttccacagcttaagttgttataggtttcaatcatctgtccacctccctgactttcataattttgtgaaatacccttgca tatatatatgggactaaatattattttctcctggttgtccataatagattaatttaattcctaaacaaagaacagaacatagattgg tatagtagaagagtttcccttctccctactgcatgaatggaaattccccaaaccatccttatcagagaaattaactcacatactag tcacctttcattcagctggatgacaaaatcattttaaaaaaagagaataaagaaaacagataagaacaactagatctaggaat aatacttaaaatatgattctgcttagtaggtttcattcacacacctagaaaaaaaaatcagtcaatgtttcctttgggcagaaaat gagcaataatgggtatgcattgaccactactgttggacatagccttattgcttcatatagcatctattcaaagtctcagatcaaca ctatgaaaacctgtcatctctgtattagatgatgtgactggggctgtaaagggtaagctcttttcttacagctatacaacaacgct aagaccaagttctgtgctttgagcccaggcagtttagtttcccaggagcaacctaaagcctgattcacaggcatatgtatgatcc aaactgaatggtagtacatcaataccaaaacaatctattggtggaaacacaccataggtgatcgaaatactccattttcttttcct ctcatgacttctgttctgagcagtcctcttcctaaagtctacattgtcttctgagttcaggctgacatcttgacatcctcctggctgg cacagtctctggacaaggagggaagaaggagagaaggggaaagggagaggagggggggagggagagaaagaatggg aagaggaaggatatgaaagagagaagagaggagggaaggcgggaggaagggagggagggagggagggagagagg gagagagaggagagagagagagagagagagagagagagagagagagagagagagagagagagagagggagaggg agagagagacagagagagagagagggagagggagagagagagagagagagagagagagagagagagagagagag agtgaggagagagagagagagttttcttcaccattggacattcctaaagaaaagaagtaaatgcaggattggggacagtga cagaggacctctgataaactttctgaggcctctgacctcactctctcggagccctcctccaccacccaccccccccctccctagct gagaaaagcttccaggaaatgtcccagtcatcgcttcccctcccgggctgggggctgggagcgggcggtcccctcaggccag ggctgctccggccgcgctcgggcagggccacaacagagctgggaaagctgagcccaggctcgcagctcctctgggcggagc gccggctcggtccccgctgcgccagccgtgatccccggcagcctgctcagca exon1 (SEQ ID NO 30): atgagggtcctgagcgtacgctgtcggctactgctggtatgcctagccctggtgctgccagcctcggagacaaac intron1 (SEQ ID NO 31): tgtaagtaatccatacctcctggcttctccattccctatgtgccccggcttgaagattttccactaggctgtttgctgcctcctaagtt tccagtaagtccgccaccattcagagagtcgcggcagcctgggtctggtgggcagtgtaaaggtgggacaggatcaaagctt gccttgctttgagaaccattgtccacaggacttgattccagaacccgggtgacactaagtgtcaaaggaattgcttgaacatag tcctaaatattgctaggaaagctaagtcaagcctgttgccctcctcccgtttacaagagtgccccagcccgcaccctctcctgcg gctaaccttccttttgcaatttctggactttgaacttgattgactggtctcacattgacaaactgtttggggactgctggggtgttac atatgattctctaaccttgatataagaaatagctgttggatgttaccttgtaccgaggatcattttctgagggttttgactgttgcc gctttgagatggcagcaagaattctgtacaacacacacatttttgtgtttcttggtctttcctcttcccattctcagattccgggcag tatatcgagttttctcttagaaatataaaacgaaccacaaggttttagtacattttaatggtcaattaaattgtttttagaagcttaa atatgttcataattaacactgctttcttttgctcttttgtagtcccagtcactggcatgggagcaataactgtataacaaataccact taggtcactgcgagcaccaaagaaacttttcaaagatggtaattaagtaggagtttgctggaattgcaagtttttattaattagt aaggaatctagcctgatatttttaaatgtctaactaagttaaagaccagaatgaaactggttcactttttattgaggataaacaa gttacagttataaagcctcaacaatcaaagccctacgatgaagcagcgtgtgactgtatgcacatgatctatcttgttcagagg aacaatcaaacattttcagatagcatcagggcggtggtggtactcgcctataatcctagcaaagtcagaggcaagcagatctc tgtgttcaaggccagcctagtctacagagtgagttccaggacaactggggctacacagagaaacctgtctcagagaaaaaca aaataaaaccaaattcagatagctggtgtttgggaaaagagcaaaagacagcagtgctggccacacagagagtagacaag ttcattctacaaggacatcacagaaagaatatgtgacccaatgacgaccataaactttcttgttcctgtgtcaaattatctccggt ttattgatgaagaaccagacactatgagctgcgtctcctccttaagattttgttttggtgtcttgtttttgtcaaggggtttcattgtg gccctgagcattagatccagggctttgtgcatgctaggccagggagctatattcccgaactccagaagactaggaatttgagat ataaatagaatttgaattaccttctgtacaattgattgtatggttctagaaatattgctatattaagggaagcctttgcagaagac agttattttgagatggtgcataacacaaaagaaatgaactaaagcctgaggcctgctctgtagctctgccttgcccttagcctac aataactttctttacctttcaagcatgtgccaccacgcctgactttcaggcccttcattttaacaagaaagcaagtattcagttatc aactgactttccaaatgcatttgtatgaataaaaactacaaaaatataaaaataagaactatacacacaaaagccttgtattta aaatttacgctgtggacatattttgctcatcattcgtgagagcttgcggtaaaaaggcaaaggggaagaggaggatatctattt tgggtaggctaatttggccttatccagacttcccttttgggtggatgcagtctgcccagcacactattggcccatttcttctacatg gctttgtgctctgctctgcccttagctaattgtcccctttgacatgcttttgtctttccttaaagtttctatacttcaaaaaccatcccgc tacactaatggagtgattttctcaagggttgctttatgtttggggtttgtactgcaagagttagtttctgatatagcaatggtgata gtatagtcttctaccatgaactctatgccagcaagtacaggggtatatttcacatgggtgttttctgttcactgagtttcatgtcttc tttgtatctttttgttttgttttgtgagacagggtttctctgtagcttttgagtcagtcctggaacttgctggccggccttgaactcaca gagattcacctgcctctgcctcccaagtgctgggatttaaggtgtgagtcaccactgccaggttttttctttgtatcttgagtgaac taaataggtaagctttaaataataatatgagcagtctatttatatacattaaatattaaatgcattgtgagatgagcatagccttt gaggcccaggaacagaaagatttacttcacattgtaaatatactggtatacatacaaacgtacatacnnnnnngtgtgtgtgt gtgtgtgtgtgtgtgtgtgcatgccatagcacacatgtgaagtccagagtacagcattctctttttctacctttctgtagattcttgt ggtcagagtcaggtcaaatcaaatcagacagatgcatgtataaaatgctcttacccactgaaccatcttgctgcttggtccaca agcttagtggaagaatgctgggaagtgaatagtatgtttttaaatgtagttaaccttgactttttgttgttgttgctgttattgagg ccacattttcattgttctgagaaaatattactattttcctcagacagaattatatatttatttgaagttcatgaattccatattattttc ctgtatttattacaaatagcatgcttaaacacttccaagtagtgaaacagctgctcatgtaggacacggattattgacagtgctg ccatttatcagccagtaatccacttggcaggtagcacgctcatcgttatcctttatgcacacaaagccttgtttgaattttatctttt aatgagtgtcaatgaaatggaaagagataagagttaaaaatacaacccaaactattgtatttacatttctcttttagaagaaac ctaaagcagcattacttcttgcccatatttaataaataacatcatttacccttgttccctgcctccagactctcccatatactcctcttt caattttattggcccctttaaatgacatatcattacatgtatatccctacacataagtataaccagttcagtttgtataatgttacttg catgtgtgttttcaatgctgatcatttggtagtggataaccaatggtgtgccctatgaaggggcagagtatttgtatcatgcttag cattcctttgttgactgtaggattttgtttaaggttgaggtctcttggtctttcccctgtctgcttctgcatgtccatggccatccttgtt cagctcatgtttatgtagtcatgctgatgaggctttatggatgtagcttctgacattgctaagcaacacagtctcagcaaactccc cagtcctctggttcttacaatctttccacactgtttcaccatgttgtctgagccttaggtgctgaagttgttttgtgtctgtatccattg ggactaggctccacatgtctgcattttgattacttgtggttttctgtaacggtctctatgtgttgcaacgagaaggagtagttgctt tgacgatgtgtaaagactatcttgtgggtataaggacaaatatttgcatgaagctatggattatgctggtctcaagcatgaactg gataaattgtacagctcacacaaaacagctatagctagctgcacagtcaggcatgcactgatctgcttggggagttgttaacca aagggcttacatagctatgtattttctaagctctagttttactatcacaaagaaaattaattcacccttaattgtttaataagatgat atatcttagggaaaaaatgaaggtctttttttgacttatataaaagcttatgttttctacagttt exon2 (SEQ ID NO 32): tgtcaaaagaacatgcctcgcaagtcctggtgaggaagcgccgcgcaaataccttgcttgaagaaactaaaaagggcaatct tgaaagagaatgcatcgaagagctctgcaataaagaggaagccagggaggtctttgaaaacaatcccgaaacg intron2 (SEQ ID NO 33): gtaagagttcgtggaaatgaccaagtccacactcggatatatattggcagtcagaacactgccagcttgagctaccttgcttct gtttgaaagctaatgacttaggagttcatttctcatgtgttaccactgacatttcaggcaggctgccaatgacaggcactccagc caaactccatttcccttaagtctcattactcgcaactagtatcgactttataatgtgtgactattttattatcctaaccaaatctggta gccttgagggtgcaagagaagatgcgactgaagggtaagtgaccatatatgtacttgcattgtcactgtgcttttgttttggttg attgtgtttgagacagtctcttactctgtagctccaactacaaggagctccctatccatctgctttggcttcagcctcccaagtact gtgattatagactggtgtgtcttgccatttatctttaagaggctctagatagaaatggggccacctaactgagattagtcattaca gcattatgtatgctgactgtatactattctgtaaccttcatgaagtttcccgaggccactgataatcagcagtaatcattagtgtct aaaaatttccaagttacccacccgccaaacataacataaagacagcaacatgggactctttgtccattctgtgtttcaggagag ggcaatttatagtatgcttgtaactaacaggagtagcattaatatctccaaggagcactttgagcatgaccttgagagtctacat ggaacactgttcagggtctcctcagatgttctacctgagctgaattatacaatctggaggaaaagaaagagatgacatacaca aggctcctcctttgcctctgccacagctcccagaaccatgacaacagctgagtgataaagagcaaggactctttgtccatactta gaaaatttgtccccaactgtagctacttgtggtctgtggttgttattgtagctcttttttaatccctatgtgttctgataggttcaaag aagaaattttccccaaatatgcaacaattaaattttaatctacctagaattgagacaaaaatgtgacgaaataccttgatcaaa aaaacaactcaggaggaaagggtttttttttttttttttggtttactaacctgaattgagggaagcaaaagtaggagctcaaacc aggtgggaacctggaggcaggagctgatgcagaggcatggaggagtgctgcttactggtctgctcctcatggcttgctcagct tgttttcttatagaacccagggccaccgtcacaaaagtaccatcacctgcaatgggttgggcccttccccagggatctctgatta agaaaattccctacaggtctgtctacaattcttttttgtttgtttgtttgtttgtttgtttgtttgttttcgagacagggtttgtctgtata gctttggagcctgtcctggaactcactctgtagaccaggttggcctcgaagtcacaaagatccacctgcctttgcctccctagtgc tgggattaaaggcttgtgtcaccactgccaggcctattttaaggaagcatttttctccttgagattccttcctctcaaatgattctag cttgtatcaagttgacataaaattagccagcacagacaacaacaatagaaaattttctatcctacacaatgtaataaatttattg ggtaggatttaacatatgtattctatgttttacattctcattctaaaaaggaatgtgtatgcactcttacaaacttccataatacaa aagaatacagtatgtattagatatgtgcatatattccttccctttatggaaagtttaaaaagtagaaagaatggtataataaact gcaacacaacacgtccctctaataagatcaaggctttcatttgattttgcctatccaccacatctaatcaatggttttgctttgagc aatcaagtcacatgattatattacccatacttgagttgtatatctgcattgtagatatgttctcaaagctcagcctttaaagagtag tagggagggaagatggaccacaggaagaagggggaggaaggtgaagaagaaaacacattcgtgtttctttaccttcacta atagttttgttgacagattccacctactccctgtccatatccctcatactcttaggccagtattcccagtgttattgaccctgatgttt acctgttcgcttgtcatcagcatgtcaccaatctttaaatgccattgtttgtctccttattgtcttgtctctgcttctgcagtaaacaac actgttgtctgaatgagtcagtgtcaggcccctttcttataagccagtagaaacgtgcaagtttgtacatgataagaggaaaga gtgtagattttgatgtagaaaaagccaagctccactctaagccagaattttgaatactttttatgcagaaattttgtttttgtatga aatattcttgtgttatttatttacattatgagtgtactgtcagaagctcataaaaattaccctgttcataaaatacattccttcatcca tatgtcatcattattttgctatccatcaatatataaggaaggtgtttcacatgcattagatgcaataaggtaagtggtcattttagtt ctctttaaatgatttcattgttgactccagtgtagatagtcatcatggcataagatgtatcaaatgaagactaggtgtggtggtgc ataccttcagtcccagcacacagaggcagaggaacatggattgctgtgagtttcaggtggacctggtctacatagtgagttcca aggtagatagagggtgtctcgagagaccctgtaagaaaagtctatgtttaattgccatgaaaaaattagaggattataaaaga gggaatatattgttatagttatcaactacaaccagttcaaatcagaagctttaaaatgttattttattgttcagtagtgttttaagca tatatatgtatacacacaaacatatatgtgtttatatatatgtatatgtatactggtcaagtattggctatctattcttgaagtattta tagaaaaattagaaatgtgaaaacatacaacatgtaggtcatttccatattcatataaaagcaaattagaaaaattaatcttta actctgtagtgatatttgagtttgctaatatctatttttttattttctttctag exon3 (SEQ ID NO 34): gattatttttatccaaaatatttgg

Example 4 Transcription Factor Engineering

Expression of Protein S may be reduced or abolished by transcriptional down regulation of Protein S mRNA. Transcription factors can be designed to bind specific DNA elements in the promoter region of the CHO Protein S gene. Zinc finger proteins are ideal for such a manipulation and common procedures are reviewed by Wolfe-S A et al. Annu. Rev. Biophys. Struct. vol 3:183-212, 1999 and Jamieson-A C et al. Nature Reviews, vol 2:361-368, 2003. Typically a single zinc finger binds three bases adjacent to each other on the same DNA strand and a forth base on the complementary strand. Thus, several zinc fingers can be combined in order to bind a desired DNA element. Recognition of a DNA element of 15-18 base pairs, which actually can be universal in the genome, needs a combination of 5-6 zinc fingers.

A DNA element having the sequence GGAGAGGAGGGGGGG (SEQ ID NO 35) from the CHO Protein S promoter are chosen and Zinc finger proteins binding the DNA element is predicted based on the publications by Liu-P Q et al., Journal of Biological Chemistry, Vol. 276 (14), pp. 11323-11334, 2001 and Zhang-L et al. Journal of Biological Chemistry, Vol. 275 (43), pp. 33850-33860, 2000. A synthetic five zinc finger protein based on SP1 and BTEB4 is made by PCR from overlapping oligonucleotides as described in Zhang-L et al. Journal of Biological Chemistry, Vol. 275 (43), pp. 33850-33860, 2000: Zinc finger 5 CXXCXXXXXQSGHLQRHXXXH (SEQ ID NO 36) interacts with GGAg; zinc finger 4 CXXXXCXXXXXRSDNLARHXXXH (SEQ ID NO 37) interacts with GAGg; zinc finger 3 CXXCXXXXXRSDNLTRHXXXH (SEQ ID NO 38) interacts with GAGg; zinc finger 2 CXXXXCXXXXXRSDHLTRHXXXH (SEQ ID NO 39) interacts with GGGg; zinc finger 1 CXXXXCXXXXXRSDHLARHXXXH (SEQ ID NO 40) interacts with GGGa; and N-terminal to the zinc fingers the KRAB domain of KOX1 is inserted.

Upon binding of the engineered zinc finger protein to the GGAGAGGAGGGGGGG (SEQ ID NO 41) DNA element the CHO Protein S transcription was expected to be down-regulated.

The CHO Protein S promoter region (SEQ ID NO 29) was cloned into pGL3-basic (Promega, Madison) and was used as reporter construct in a luciferase reporter assay to determine the effect of ZNF-PS. The plasmid encoding the ZNF-PS gene and the CHO Protein S reporter plasmid were transfected into CHO-K1 cells and luciferase activity was determined. ZNF-PS can downregulate Protein S transcription 50% in a dose-response independent manner. FIG. 3 a illustrates ZNF-PS downregulation of Protein S. In a similar experiment the CHO-K1 cells were transfected with ZNF-PS and pEGFP (Enhanced Green Flourescent Protein) and Protein S and pEGFP mRNA were determined by real-time PCR. pEGFP served as transfection control. FIG. 3 b shows a downregulation of Protein S by 50%.

ZNF-PS (SEQ ID NO 42) Mdaksltawsrtlvtfkdvfvdftreewklldtaqqivyrnvmlenyknl vslgyqltkpdvilrlekgeepwlvereihqethpdsetafeikssvssr sifkdkqscdikmegmarndlwylsleevwkpgkkkqhichiqgcgkvyg rsdhlarhlrwhtgerpfmctwsycgkrftrsdhltrhkrthtgekkfac pecpkrfmrsdnltrhikthtgerpfacdwqgcdkkfarsdnlarhhrth tgekrfscplcskrftqsghlqrharrhpgfhpdllrrpgarstspsdsl pcslagspapspapspapagl

Example 5 Determination of Numbers of Protein S Alleles in the CHO K1 Genome

The CHO-K1 cell line has only 21 chromosomes, compared to the Chinese Hamster which has 22 chromosomes, and only 8 of these 21 are normal. In the 13 altered chomosomes translocations, deletions, and pericentric inversions have been detected (Deaven & Petersen, Chromosoma 1973; 41(2), 129-144). It is not known whether the Protein S gene is present on normal or altered chromosomes or how many alleles are present in the CHO-K1 genome.

The SeeDNA Biotech Inc. company performed a FISH (Flourescence In Situ Hybridization) analysis on the genome of CHO K1 cells (ATCC# CCL-61) using a plasmid containing the Protein S Intron 1 probe (SEQ ID NO 43) in pCR2.1 (Invitrogen, Carlsbad) cloned and supplied by us. The results of the FISH analysis is shown i FIG. 4. The Protein S gene is localized onto two different chromosomes in the same metaphase. The chromosome with locus A (shown by an arrow) is submetacentric and of smaller size. The chromosome with locus B (shown by an arrowhead) is metacentric and of bigger size. The banding pattern of these two chromosomes is also different.

The Protein S Intron 1 probe (SEQ ID NO 43) ctgctggtatgcctagccctggtgctgccagcctcggagacaaactgtaagtaatccatacctcctggcttctccattccctatgt gccccggcttgaagattttccactaggctgtttgctgcctcctaagtttccagtaagtccgccaccattcagagagtcgcggcag cctgggtctggtgggcagtgtaaaggtgggacaggatcaaagcttgccttgctttgagaaccattgtccacaggacttgattcc agaacccgggtgacactaagtgtcaaaggaattgcttgaacatagtcctaaatattgctaggaaagctaagtcaagcctgttg ccctcctcccgtttacaagagtgccccagcccgcaccctctcctgcggctaaccttccttttgcaatttctggactttgaacttgatt gactggtctcacattgacaaactgtttggggactgctggggtgttacatatgattctctaaccttgatataagaaatagctgttgg atgttaccttgtaccgaggatcattttctgagggttttgactgttgccgctttgagatggcagcaagaattctgtacaacacacac atttttgtgtttcttggtctttcctcttcccattctcagattccgggcagtatatcgagttttctcttagaaatataaaacgaaccaca aggttttagtacattttaatggtcaattaaattgtttttagaagcttaaatatgttcataattaacactgctttcttttgctcttttgtag tcccagtcactggcatgggagcaataactgtataacaaataccacttaggtcactgcgagcaccaaagaaacttttcaaagat ggtaattaagtaggagtttgctggaattgcaagtttttattaattagtaaggaatctagcctgatatttttaaatgtctaactaagt taaagaccagaatgaaactggttcactttttattgaggataaacaagttacagttataaagcctcaacaatcaaagccctacga tgaagcagcgtgtgactgtatgcacatgatctatcttgttcagaggaacaatcaaacattttcagatagcatcagggcggtggt ggtactcgcctataatcctagcaaagtcagaggcaagcagatctctgtgttcaaggccagcctagtctacagagtgagttccag gacaactggggctacacagagaaacctgtctcagagaaaaacaaaataaaaccaaattcagatagctggtgtttgggaaaa gagcaaaagacagcagtgctggccacacagagagtagacaagttcattctacaaggacatcacagaaagaatatgtgaccc aatgacgaccataaactttcttgttcctgtgtcaaattatctccggtttattgatgaagaaccagacactatgagctgcgtctcctc cttaagattttgttttggtgtcttgtttttgtcaaggggtttcattgtggccctgagcattagatccagggctttgtgcatgctaggc cagggagctatattcccgaactccagaagactaggaatttgagatataaatagaatttgaattaccttctgtacaattgattgta tggttctagaaatattgctatattaagggaagcctttgcagaagacagttattttgagatggtgcataacacaaaagaaatgaa ctaaagcctgaggcctgctctgtagctctgccttgcccttagcctacaataactttctttacctttcaagcatgtgccaccacgcct gactttcaggcccttcattttaacaagaaagcaagtattcagttatcaactgactttccaaatgcatttgtatgaataaaaactac aaaaatataaaaataagaactatacacacaaaagccttgtatttaaaatttacgctgtggacatattttgctcatcattcgtgag agcttgcggtaaaaaggcaaaggggaagaggaggatatctattttgggtaggctaatttggccttatccagacttcccttttgg gtggatgcagtctgcccagcacactattggcccatttcttctacatggctttgtgctctgctctgcccttagctaattgtcccctttga catgcttttgtctttccttaaagtttctatacttcaaaaaccatcccgctacactaatggagtgattttctcaagggttgctttatgttt ggggtttgtactgcaagagttagtttctgatatagcaatggtgatagtatagtcttctaccatgaactctatgccagcaagtaca ggggtatatttcacatgggtgttttctgttcactgagtttcatgtcttctttgtatctttttgttttgttttgtgagacagggtttctctgt agcttttgagtcagtcctggaacttgctggccggccttgaactcacagagattcacctgcctctgcctcccaagtgctgggattta aggtgtgagtcaccactgccaggttttttctttgtatcttgagtgaactaaataggtaagctttaaataataatatgagcagtctat ttatatacattaaatattaaatgcattgtgagatgagcatagcctttgaggcccaggaacagaaagatttacttcacattgtaaa tatactggtatacatacaaacgtacatacnnnnnngtgtgtgtgtgtgtgtgtgtgtgtgtgtgcatgccatagcacacatgtg aagtccagagtacagcattctctttttctacctttctgtagattcttgtggtcagagtcaggtcaaatcaaatcagacagatgcat gtataaaatgctcttacccactgaaccatcttgctgcttggtccacaagcttagtggaagaatgctgggaagtgaatagtatgtt tttaaatgtagttaaccttgactttttgttgttgttgctgttattgaggccacattttcattgttctgagaaaatattactattttcctca gacagaattatatatttatttgaagttcatgaattccatattattttcctgtatttattacaaatagcatgcttaaacacttccaagta gtgaaacagctgctcatgtaggacacggattattgacagtgctgccatttatcagccagtaatccacttggcaggtagcacgct catcgttatcctttatgcacacaaagccttgtttgaattttatcttttaatgagtgtcaatgaaatggaaagagataagagttaaa aatacaacccaaactattgtatttacatttctcttttagaagaaacctaaagcagcattacttcttgcccatatttaataaataaca tcatttacccttgttccctgcctccagactctcccatatactcctctttcaattttattggcccctttaaatgacatatcattacatgtat atccctacacataagtataaccagttcagtttgtataatgttacttgcatgtgtgttttcaatgctgatcatttggtagtggataacc aatggtgtgccctatgaaggggcagagtatttgtatcatgcttagcattcctttgttgactgtaggattttgtttaaggttgaggtc tcttggtctttcccctgtctgcttctgcatgtccatggccatccttgttcagctcatgtttatgtagtcatgctgatgaggctttatgga tgtagcttctgacattgctaagcaacacagtctcagcaaactccccagtcctctggttcttacaatctttccacactgtttcaccatg ttgtctgagccttaggtgctgaagttgttttgtgtctgtatccattgggactaggctccacatgtctgcattttgattacttgtggtttt ctgtaacggtctctatgtgttgcaacgagaaggagtagttgctttgacgatgtgtaaagactatcttgtgggtataaggacaaat atttgcatgaagctatggattatgctggtctcaagcatgaactggataaattgtacagctcacacaaaacagctatagctagct gcacagtcaggcatgcactgatctgcttggggagttgttaaccaaagggcttacatagctatgtattttctaagctctagttttac tatcacaaagaaaattaattcacccttaattgtttaataagatgatatatcttagggaaaaaatgaaggtctttttttgacttatat aaaagcttatgttttctacagttttgtcaaaagaacatgcctcgcaagtcctggtgaggaagcgccgcgcaaataccttgcttga agaaactaaaaagggcaatcttgaaagagaatgcatcgaagagctctgc

Example 6 Gene Targeting of CHO Protein S Enhanced by Zinc Finger-Nuclease Fusion Proteins

Gene targeting by homologous recombination is hard and laborious work because the somatic recombinations that takes place in mammalian cells not very often are homologous. However, site-specific cleavage of the DNA strands can enhance homologous recombination. Engineering of DNA binding zinc fingers fused to endonucleases makes it possible to design almost exactly where the DNA cleavage should occur (Durai et al., Nucleic Acids Research, 2005; 33(18), 5970-5990 and Smith et al., Nucleic Acids Research, 2000; 28(17), 3361-3369). Two zinc finger proteins, designed to bind 5′-GTCCTGAGC-3′ (right finger) and 5′-GCTGGTATG-3′ (left finger) elements, were made in the framework published by Mani et al. (Mani et al., Biochemical and Biophysical Research Communications 2005, 335; 447-457). Zinc finger DNA binding specificity of zinc finger has previously been described by (Rebar-E3, et al. Nature Medicine 8 (2002) 1427-1432; Liu-P Q, et al. Journal of Biological Chemistry 276 (2001) 11323-11334; Ren-D, et al. Genes & Development 16 (2002) 27-32; Mani-M, et al Biochemical and Biophysical Research Communications 335 (2005) 447-457).

The right and left zinc finger were both either fused to the nuclease domain of Fok I and Sts I restriction enzyme (SEQ ID NO 44-51). Fok I and Sts I restriction enzyme needs to homodimerize to be able to cleave DNA, which also increase the specificity of the zinc finger pair. The function of the engineered nucleases are illustrated in FIGS. 5 and 6. When the genomic DNA has been cleaved by the zinc finger nucleases the repair mechanism will seek to repair the gap, very likely by homologous recombination. The gene targeting construct (SEQ ID NO 52) devoid of the Protein S gene will be transfected along with the nucleases. In the place of Protein S exon 1 in the genome the EGFP gene will be inserted. FIG. 6 illustrates the flow scheme of the homologous recombination. The gene targeting construct was made by exchanging the luciferase gene in the Protein S reporter construct (example 4) by the EGFP-gene and further inserting Protein S intron 1 after the poly A signal. The construct consists of Protein S promoter, EGFP-gene, PolyA-signal and Protein S intron1, no exon1. The homozygous recombinant CHO cell line will express EGFP and not Protein S because the Protein S signal peptide has been deleted and the transcript will be truncated right after the EGFP coding sequence due to the PolyA signal. Heterozygous cell clones are expected to be most abundant and a PCR analysis will reveal whether we have succeeded to make a homozygous cell clone. A heterozygous cell clone can be treated a second time with a targeting vector containing an antibiotical resistance gene in place of the EGFP gene to facilitate selection.

Left zinc finger-Fok I DNA sequence (SEQ ID NO 44) atgaagctactgtcttctatcgaacaagcatgcccaaaaaagaagagaaaggtagatgaaaaaccttacaagtgtccggaat gtgggaagtcctttagtcggagcgacaacctggcccggcaccagcggacgcataccggtgagaagccctacaaatgcccag aatgcggaaaatcattttcgcggagcagcaacctgcgggagcaccaacgaacccacacaggcgagaaaccatttaaatgtc ctgagtgtggtaagagctttagccggagcgacaacctgacccggcatcaagctactcatacgggcggcggtggcagcggtg gcggtagcggcggtggcagcggtggcggatcccaactagtcaaaagtgaactggaggagaagaaatctgaacttcgtcata aattgaaatatgtgcctcatgaatatattgaattaattgaaattgccagaaattccactcaggatagaattcttgaaatgaaggt aatggaattttttatgaaagtttatggatatagaggtaaacatttgggtggatcaaggaaaccggacggagcaatttatactgt cggatctcctattgattacggtgtgatcgtggatactaaagcttatagcggaggttataatctgccaattggccaagcagatga aatgcaacgatatgtcgaagaaaatcaaacacgaaacaaacatatcaaccctaatgaatggtggaaagtctatccatcttctg taacggaatttaagtttttatttgtgagtggtcactttaaaggaaactacaaagctcagcttacacgattaaatcatatcactaatt gtaatggagctgttcttagtgtagaagagcttttaattggtggagaaatgattaaagccggcacattaaccttagaggaagtga gacggaaatttaataacggcgagataaacttttag Left zinc finger-Sts I DNA sequence (SEQ ID NO 45) atgaagctactgtcttctatcgaacaagcatgcccaaaaaagaagagaaaggtagatgaaaaaccttacaagtgtccggaat gtgggaagtcctttagtcggagcgacaacctggcccggcaccagcggacgcataccggtgagaagccctacaaatgcccag aatgcggaaaatcattttcgcggagcagcaacctgcgggagcaccaacgaacccacacaggcgagaaaccatttaaatgtc ctgagtgtggtaagagctttagccggagcgacaacctgacccggcatcaagctactcatacgggcggcggtggcagcggtg gcggtagcggcggtggcagcggtggcggatccgtattagaaaaaagtgatattgaaaaatttaagaatcaattgcgtacgga actaaccaatattgaccattcttatcttaaaggaattgatatagctagtaaaaagaaaaccagtaatgttgaaaatacggaattt gaagcaatatcaaccaagatttttacggatgagttgggtttttcaggcaaacatctaggaggaagcaacaaaccagatggact cctgtgggatgatgattgtgcaattattcttgattcaaaagcttactcagaaggctttccactcactgcctcccacacagatgctat gggaagatatttgaggcaatttacagagcgaaaagaagaaataaagccaacgtggtgggatattgctccagaacatttaga caatacatatttcgcttacgtttctgggagtttttcgggtaattataaggaacagttacaaaaatttaggcaagatacaaaccatt taggtggggcactagagtttgttaaattgttattactagcaaataattataaaactcaaaaaatgagtaaaaaagaagttaag aaaagtattcttgattataatatttcatatgaagaatatgctccattacttgcagaaatagagtaa Right zinc finger-Fok I DNA sequence (SEQ ID NO 46) atgaagctactgtcttctatcgaacaagcatgcccaaaaaagaagagaaaggtagatgaaaaaccttacaagtgtccggaat gtgggaagtcctttagtcggagcgacgccctgacccagcaccagcggacgcataccggtgagaagccctacaaatgcccag aatgcggaaaatcattttcgcagagcagccacctggcccggcaccaacgaacccacacaggcgagaaaccatttaaatgtcc tgagtgtggtaagagctttagccagagcagccacctgacccggcatcaagctactcatacgggcggcggtggcagcggtggc ggtagcggcggtggcagcggtggcggatcccaactagtcaaaagtgaactggaggagaagaaatctgaacttcgtcataaa ttgaaatatgtgcctcatgaatatattgaattaattgaaattgccagaaattccactcaggatagaattcttgaaatgaaggtaa tggaattttttatgaaagtttatggatatagaggtaaacatttgggtggatcaaggaaaccggacggagcaatttatactgtcg gatctcctattgattacggtgtgatcgtggatactaaagcttatagcggaggttataatctgccaattggccaagcagatgaaat gcaacgatatgtcgaagaaaatcaaacacgaaacaaacatatcaaccctaatgaatggtggaaagtctatccatcttctgtaa cggaatttaagtttttatttgtgagtggtcactttaaaggaaactacaaagctcagcttacacgattaaatcatatcactaattgta atggagctgttcttagtgtagaagagcttttaattggtggagaaatgattaaagccggcacattaaccttagaggaagtgaga cggaaatttaataacggcgagataaacttttag Right zinc finger-Sts I DNA sequence (SEQ ID NO 47) atgaagctactgtcttctatcgaacaagcatgcccaaaaaagaagagaaaggtagatgaaaaaccttacaagtgtccggaat gtgggaagtcctttagtcggagcgacgccctgacccagcaccagcggacgcataccggtgagaagccctacaaatgcccag aatgcggaaaatcattttcgcagagcagccacctggcccggcaccaacgaacccacacaggcgagaaaccatttaaatgtcc tgagtgtggtaagagctttagccagagcagccacctgacccggcatcaagctactcatacgggcggcggtggcagcggtggc ggtagcggcggtggcagcggtggcggatccgtattagaaaaaagtgatattgaaaaatttaagaatcaattgcgtacggaac taaccaatattgaccattcttatcttaaaggaattgatatagctagtaaaaagaaaaccagtaatgttgaaaatacggaatttg aagcaatatcaaccaagatttttacggatgagttgggtttttcaggcaaacatctaggaggaagcaacaaaccagatggactc ctgtgggatgatgattgtgcaattattcttgattcaaaagcttactcagaaggctttccactcactgcctcccacacagatgctat gggaagatatttgaggcaatttacagagcgaaaagaagaaataaagccaacgtggtgggatattgctccagaacatttaga caatacatatttcgcttacgtttctgggagtttttcgggtaattataaggaacagttacaaaaatttaggcaagatacaaaccatt taggtggggcactagagtttgttaaattgttattactagcaaataattataaaactcaaaaaatgagtaaaaaagaagttaag aaaagtattcttgattataatatttcatatgaagaatatgctccattacttgcagaaatagagtaa Left zinc finger-Fok I protein sequence (SEQ ID NO 48) mkllssieqacpkkkrkvdekpykcpecgksfsrsdnlarhqrthtgekpykcpecgksfsrssnlrehqrthtgekpfkcpe cgksfsrsdnltrhqathtggggsgggsgggsgggsqlvkseleekkselrhklkyvpheyielieiarnstqdrilemkvmef fmkvygyrgkhlggsrkpdgaiytvgspidygvivdtkaysggynlpigqademqryveenqtrnkhinpnewwkvyps svtefkflfvsghfkgnykaqltrlnhitncngavlsveelliggemikagtltleevrrkfnngeinf Left zinc finger-Sts I protein sequence (SEQ ID NO 49) mkllssieqacpkkkrkvdekpykcpecgksfsrsdnlarhqrthtgekpykcpecgksfsrssnlrehqrthtgekpfkcpe cgksfsrsdnltrhqathtggggsgggsgggsgggsvleksdiekfknqlrteltnidhsylkgidiaskkktsnventefeaist kiftdelgfsgkhlggsnkpdgllwdddcaiildskaysegfpltashtdamgrylrqfterkeeikptwwdiapehidntyfay vsgsfsgnykeqlqkfrqdtnhlggalefvkllllannyktqkmskkevkksildynisyeeyapllaeie Right zinc finger-Fok I protein sequence (SEQ ID NO 50) mkllssieqacpkkkrkvdekpykcpecgksfsrsdaltqhqrthtgekpykcpecgksfsqsshlarhqrthtgekpfkcpe cgksfsqsshltrhqathtggggsgggsgggsgggsqlvkseleekkselrhklkyvpheyielieiarnstqdrilemkvmef fmkvygyrgkhlggsrkpdgaiytvgspidygvivdtkaysggynlpigqademqryveenqtrnkhinpnewwkvyps svtefkflfvsghfkgnykaqltrlnhitncngavlsveelliggemikagtltleevrrkfnngeinf Right zinc finger-Sts I protein sequence (SEQ ID NO 51) mkllssieqacpkkkrkvdekpykcpecgksfsrsdaltqhqrthtgekpykcpecgksfsqsshlarhqrthtgekpfkcpe cgksfsqsshltrhqathtggggsgggsgggsgggsvleksdiekfknqlrteltnidhsylkgidiaskkktsnventefeaist kiftdelgfsgkhlggsnkpdgllwdddcaiildskaysegfpltashtdamgrylrqfterkeeikptwwdiapehldntyfay vsgsfsgnykeqlqkfrqdtnhlggalefvkllllannyktqkmskkevkksildynisyeeyapllaeie Protein S promoter-EGFP-Protein S intron 1 targeting construct (SEQ ID NO 52) ggtaccgagctcttacgcgtgctagcccgggctcgagatctcaaccccttttgaccatacacatttctactctttgtgtttgctgga gctgttttctccccacactcaaccccctttgctgaagcctggaacttgctttccacagcttaagttgttataggtttcaatcatctgtc cacctccctgactttcataattttgtgaaatacccttgcatatatatatgggactaaatattattttctcctggttgtccataatagat taatttaattcctaaacaaagaacagaacatagattggtatagtagaagagtttcccttctccctactgcatgaatggaaattcc ccaaaccatccttatcagagaaattaactcacatactagtcacctttcattcagctggatgacaaaatcattttaaaaaaagaga ataaagaaaacagataagaacaactagatctaggaataatacttaaaatatgattctgcttagtaggtttcattcacacaccta gaaaaaaaaatcagtcaatgtttcctttgggcagaaaatgagcaataatgggtatgcattgaccactactgttggacatagcct tattgcttcatatagcatctattcaaagtctcagatcaacactatgaaaacctgtcatctctgtattagatgatgtgactggggctg taaagggtaagctcttttcttacagctatacaacaacgctaagaccaagttctgtgctttgagcccaggcagtttagtttcccagg agcaacctaaagcctgattcacaggcatatgtatgatccaaactgaatggtagtacatcaataccaaaacaatctattggtgga aacacaccataggtgatcgaaatactccattttcttttcctctcatgacttctgttctgagcagtcctcttcctaaagtctacattgtc ttctgagttcaggctgacatcttgacatcctcctggctggcacagtctctggacaaggagggaagaaggagagaaggggaaa gggagaggagggggggagggagagaaagaatgggaagaggaaggatatgaaagagagaagagaggagggaaggcg ggaggaagggagggagggagggagggagagagggagagagaggagagagagagagagagagagagagagagag agagagagagagagagagagagagagggagagggagagagagacagagagagagagagggagagggagagagag agagagagagagagagagagagagagagagagagagtgaggagagagagagagagttttcttcaccattggacattcct aaagaaaagaagtaaatgcaggattggggacagtgacagaggacctctgataaactttctgaggcctctgacctcactctctc ggagccctcctccaccacccaccccccccctccctagctgagaaaagcttccaggaaatgtcccagtcatcgcttcccctcccgg gctgggggctgggagcgggcggtcccctcaggccagggctgctccggccgcgctcgggcagggccacaacagagctggga aagctgagcccaggctcgcagctcctctgggcggagcgccggctcggtccccgctgcgccagccgtgatccccggcagcctg ctcagccatggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacgg ccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgcaccaccggc aagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaag cagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaa gacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacg gcaacatcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaacggca tcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacacccccat cggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagcaaagaccccaacgagaagcg cgatcacatggtcctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaaagcggccgc gactctagagtcggggcggccggccgcttcgagcagacatgataagatacattgatgagtttggacaaaccacaactagaat gcagtgaaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgtaaccattataagctgcaataaacaagttaaca acaacaattgcattcattttatgtttcaggttcagggggaggtgtgggaggttttttaaagcaagtaaaacctctacaaatgtgg taaaatcgataaggatcctgctggtatgcctagccctggtgctgccagcctcggagacaaactgtaagtaatccatacctcctg gcttctccattccctatgtgccccggcttgaagattttccactaggctgtttgctgcctcctaagtttccagtaagtccgccaccatt cagagagtcgcggcagcctgggtctggtgggcagtgtaaaggtgggacaggatcaaagcttgccttgctttgagaaccattgt ccacaggacttgattccagaacccgggtgacactaagtgtcaaaggaattgcttgaacatagtcctaaatattgctaggaaag ctaagtcaagcctgttgccctcctcccgtttacaagagtgccccagcccgcaccctctcctgcggctaaccttccttttgcaatttct ggactttgaacttgattgactggtctcacattgacaaactgtttggggactgctggggtgttacatatgattctctaaccttgatat aagaaatagctgttggatgttaccttgtaccgaggatcattttctgagggttttgactgttgccgctttgagatggcagcaagaa ttctgtacaacacacacatttttgtgtttcttggtctttcctcttcccattctcagattccgggcagtatatcgagttttctcttagaaat ataaaacgaaccacaaggttttagtacattttaatggtcaattaaattgtttttagaagcttaaatatgttcataattaacactgct ttcttttgctcttttgtagtcccagtcactggcatgggagcaataactgtataacaaataccacttaggtcactgcgagcaccaaa gaaacttttcaaagatggtaattaagtaggagtttgctggaattgcaagtttttattaattagtaaggaatctagcctgatattttt aaatgtctaactaagttaaagaccagaatgaaactggttcactttttattgaggataaacaagttacagttataaagcctcaac aatcaaagccctacgatgaagcagcgtgtgactgtatgcacatgatctatcttgttcagaggaacaatcaaacattttcagata gcatcagggcggtggtggtactcgcctataatcctagcaaagtcagaggcaagcagatctctgtgttcaaggccagcctagtc tacagagtgagttccaggacaactggggctacacagagaaacctgtctcagagaaaaacaaaataaaaccaaattcagata gctggtgtttgggaaaagagcaaaagacagcagtgctggccacacagagagtagacaagttcattctacaaggacatcaca gaaagaatatgtgacccaatgacgaccataaactttcttgttcctgtgtcaaattatctccggtttattgatgaagaaccagaca ctatgagctgcgtctcctccttaagattttgttttggtgtcttgtttttgtcaaggggtttcattgtggccctgagcattagatccagg gctttgtgcatgctaggccagggagctatattcccgaactccagaagactaggaatttgagatataaatagaatttgaattacct tctgtacaattgattgtatggttctagaaatattgctatattaagggaagcctttgcagaagacagttattttgagatggtgcata acacaaaagaaatgaactaaagcctgaggcctgctctgtagctctgccttgcccttagcctacaataactttctttacctttcaag catgtgccaccacgcctgactttcaggcccttcattttaacaagaaagcaagtattcagttatcaactgactttccaaatgcatttg tatgaataaaaactacaaaaatataaaaataagaactatacacacaaaagccttgtatttaaaatttacgctgtggacatatttt gctcatcattcgtgagagcttgcggtaaaaaggcaaaggggaagaggaggatatctattttgggtaggctaatttggccttatc cagacttcccttttgggtggatgcagtctgcccagcacactattggcccatttcttctacatggctttgtgctctgctctgcccttag ctaattgtcccctttgacatgcttttgtctttccttaaagtttctatacttcaaaaaccatcccgctacactaatggagtgattttctca agggttgctttatgtttggggtttgtactgcaagagttagtttctgatatagcaatggtgatagtatagtcttctaccatgaactct atgccagcaagtacaggggtatatttcacatgggtgttttctgttcactgagtttcatgtcttctttgtatctttttgttttgttttgtga gacagggtttctctgtagcttttgagtcagtcctggaacttgctggccggccttgaactcacagagattcacctgcctctgcctcc caagtgctgggatttaaggtgtgagtcaccactgccaggttttttctttgtatcttgagtgaactaaataggtaagctttaaataat aatatgagcagtctatttatatacattaaatattaaatgcattgtgagatgagcatagcctttgaggcccaggaacagaaagat ttacttcacattgtaaatatactggtatacatacaaacgtacatacnnnnnngtgtgtgtgtgtgtgtgtgtgtgtgtgtgcatgc catagcacacatgtgaagtccagagtacagcattctctttttctacctttctgtagattcttgtggtcagagtcaggtcaaatcaaa tcagacagatgcatgtataaaatgctcttacccactgaaccatcttgctgcttggtccacaagcttagtggaagaatgctggga agtgaatagtatgtttttaaatgtagttaaccttgactttttgttgttgttgctgttattgaggccacattttcattgttctgagaaaat attactattttcctcagacagaattatatatttatttgaagttcatgaattccatattattttcctgtatttattacaaatagcatgctta aacacttccaagtagtgaaacagctgctcatgtaggacacggattattgacagtgctgccatttatcagccagtaatccacttgg caggtagcacgctcatcgttatcctttatgcacacaaagccttgtttgaattttatcttttaatgagtgtcaatgaaatggaaaga gataagagttaaaaatacaacccaaactattgtatttacatttctcttttagaagaaacctaaagcagcattacttcttgcccatat ttaataaataacatcatttacccttgttccctgcctccagactctcccatatactcctctttcaattttattggcccctttaaatgacat atcattacatgtatatccctacacataagtataaccagttcagtttgtataatgttacttgcatgtgtgttttcaatgctgatcatttg gtagtggataaccaatggtgtgccctatgaaggggcagagtatttgtatcatgcttagcattcctttgtcgaccgatgcccttga gagccttcaacccagtcagctccttccggtgggcgcggggcatgactatcgtcgccgcacttatgactgtcttctttatcatgcaa ctcgtaggacaggtgccggcagcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggt atcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggcca gcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatc gacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctct cctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtagg tatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccg gtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcg aggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgct ctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttg tttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtgga acgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagtttt aaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtct atttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgca atgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagt ggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcg caacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaa ggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgc agtgttatcactcatggttatggcagcactgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagta ctcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccaca tagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagt tcgatgtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggc aaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcattta tcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccga aaagtgccacctgacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttg ccagcgccctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggg gctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccat cgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaacactca accctatctcggtctattcttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatt taacgcgaattttaacaaaatattaacgcttacaatttgccattcgccattcaggctgcgcaactgttgggaagggcgatcggtg cgggcctcttcgctattacgccagcccaagctaccatgataagtaagtaatattaaggtacgggaggtacttggagcggccgc aataaaatatctttattttcattacatctgtgtgttggttttttgtgtgaatcgatagtactaacatacgctctccatcaaaacaaaa cgaaacaaaacaaactagcaaaataggctgtccccagtgcaagtgcaggtgccagaacatttctctatcgata. 

1. An amino acid sequence comprising the sequence of SEQ ID NO
 3. 